A HARDY FIELD EXTENSION OF SZEMEREDI'S THEOREM 



NIKOS FRANTZIKINAKIS AND MATE WIERDL 

Abstract. In 1975 Szemeredi proved that a set of integers of positive upper density contains 
arbitrarily long arithmetic progressions. Bergelson and Leibman showed in 1996 that the 
common difference of the arithmetic progression can be a square, a cube, or more generally 
of the form p(n) where p(n) is any integer polynomial with zero constant term. We produce 
a variety of new results of this type related to sequences that are not polynomial. We show 
that the common difference of the progression in Szemeredi's theorem can be of the form [n s ] 
where 8 is any positive real number and [x] denotes the integer part of x. More generally, the 
common difference can be of the form [a(n)] where a(x) is any function that is a member of 
a Hardy field and satisfies a(x)/x k — > oo and a(x)/x k+1 — > for some non-negative integer 
k. The proof combines a new structural result for Hardy sequences, techniques from ergodic 
theory, and some recent equidistribution results of sequences on nilmanifolds. 
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1. Introduction and main results 

1.1. Introduction. In 1975 Szemeredi ([45J) answered a long standing question of Erdds and 
Turan (1936, [20]), showing that a set of integers of positive upper densit}{j contains arbitrarily 
long arithmetic progressions, or equivalently, for every t £ N, patterns of the form 

(1) {m, m + d,m + 2d, . . . , m + id} 

for some m G Z and d E N. This result has been very influential, several different proofs and 
extensions have been found, and the tools developed in the process led to applications in several 
diverse fields, that include combinatorics, number theory, harmonic analysis, ergodic theory, 
and theoretical computer science. 



2000 Mathematics Subject Classification. Primary: 37A45; Secondary: 28D05, 05D10, 11B25. 
Key words and phrases. Hardy fields, multiple recurrence, multiple ergodic averages, arithmetic progressions. 
The first author was partially supported by NSF grant DMS-0701027 and the second by DMS-0801316. 
1 If A C Z, the upper density of A is the number d(A) = limsup^^ |A n {-A, . . .,N}\/(2N + 1). If the 
previous limit exists we call it the density of A and denote it by d(A). 
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In this article we are interested in obtaining refinements of Szemeredi's theorem by restricting 
the scope of the common difference d. During the last thirty years several related refinements 
have been obtained, most notably a result of Bergelson and Leibman ([8]), who showed that 
d can be taken to be of the form p(n) where p is any non-constant integer polynomial with 
p(0) = 0. This had been previously established for t = 1 by Sarkozy ( |44j ) and Furstenberg 
(|25j). More examples, related to IP sets, generalized polynomials, polynomials with non-zero 
constant term, and the set of prime numbers, can be found in [26], [TO], [39], [6], [21 j . [22] . All 
these results were obtained using (in addition to other tools) methods that emerged from the 
pioneering paper of Furstenberg ([24]), where ergodic theory was used to give a new proof of 
Szemeredi's theorem. 

We shall produce a variety of new examples given by sequences that are not polynomial, 
and range from simply defined to rather exotic looking. For example, we shall show that the 
common difference d in (fTl) can be taken to be of the form 



(2) [n^], [nlogn], [V3n 5 / 2 + ralogn], [n 2 / log log n], f^n 2008 + (log n) 2 / 3 + nV 
where [x] denotes the integer part of x, or the form 

(3) [log(n!)], [log(r(n 3 / 2 ))], [n 2 sin(l/ log n)], [n 5 / 2 C(n)], [n 1+1 ^U(n)}, 

where T is the Gamma function, £ is the Riemann zeta function, and Li is the logarithmic 
integral function (defined by Li(x) = f£ 1/logi dt). 

A more illuminating (but incomplete) description of the class of functions for which our 
result apply is as follows: By C£ we denote the collection of logarithmico-exponential func- 
tions of Hardy ([29], [30]), consisting of all functions that can be constructed using the real 
constants, the functions e x and log a;, and the operations of addition, multiplication, division, 
and composition of functions, as long as the functions constructed are well defined for large x. 
We shall show that if a £ ££, then the common difference d in (pQ) can have the form [a(n)] 
as long as a satisfies the growth condition x k -< a{x) -< x k+1 for some non-negative integer k. 
The examples in ([2]) are of this type. 

In fact, our result applies to the much larger class of functions that belong to some Hardy 
field (a notion first introduced by Bourbaki ([18])) and satisfy the previous growth restrictions. 
This will enable us to deal with the sequences in ([3|) as well. 

Definition 1.1. Let B be the collection of equivalence classes of real valued functions a(x 
defined on some half line (u, oo), where we identify two functions if they agree for all large x 
A Hardy field is a subfield of the ring (B,+, •) that is closed under differentiation. By TC we 
denote the union of all Hardy fields. 

Hardy fields have been used to study solutions of differential equations ([H], [E], [15] . 
02]) 02])) difference and functional equations f[13|. [T4"]). properties of curves in M 2 (|19j). 
equidistribution results of sequences on the torus ([TO]), and convergence properties of ergodic 
averages (p2]). We collect some results that illustrate the richness of TC: 

• TC contains C£ and anti-derivatives of elements of C£. 

• TC contains several other functions not in ££, like the functions T(x), C{ x )i sin(l/x). 

• If a G C£ and b £ TC, then there exists a Hardy field containing both a and b. 

• If a G C£, b G TC, and b(x) — > oo, then ao b £TC. 



The equivalence classes just denned are often called germs of functions. We choose to use the word function 
when we refer to elements of B instead, with the understanding that all the operations defined and statements 
made for elements of B are considered only for sufficiently large values of x £ R. 
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If a G ££ , b £ TC, and a(x) — > oo, then bo a £ TC. 

• If a is a continuous function that is algebraic over some Hardy field, then a £ TC. 

We mention some basic properties of elements of TC relevant to our study. Every element of 
Ti has eventually constant sign (since it has a multiplicative inverse). Therefore, if a £ TC, then 
a is eventually monotone (since a' has eventually constant sign), and the limit lim x ^ OQ a(x) 
exists (possibly infinite). Since for every two functions a G TC,b £ C£ (6^0), we have a/b £ TC, 
it follows that the asymptotic growth ratio ]im x — >00 a(x)/b(x) exists (possibly infinite). This 
last property is key, since it will often justify our use of L'Hospital's rule. We are going to 
freely use all these properties without any further explanation in the sequel. 

1.2. Results in combinatorial language. The following is our main result: 

Theorem A. Let a £ TC satisfy x k -< a(x) -< x k+l for some non-negative integer k. Let £ £ N. 
Then every Ac2 with d(A) > contains arithmetic progressions of the form 

(4) {m, m + [a(n)],m + 2[a(n)], . . . , m + £[a(n)]} 

for some m £ Z and n £ N with [a(n)] ^ 0. 

Remarks. • For £ = 1, Theorem A can be easily deduced from the equidistribution results in 
|16j and the spectral theorem (see [IT] for details). 

• Our assumption can also be stated in the following equivalent form: a £ TC has polynomial 
growth and is not of the form cx k + b(x) for some non-negative integer k, non-zero real number 
c, and b £ Ti that satisfies b{x)/x k — > 0. So it is functions like x 2 -\-logx or \^2x 3 + xlogx that 
our present methods do not allow us to handle. 

• The assumption that a{x) has polynomial growth is essential if one wants to have sufficient 
conditions that depend only on the growth of the function a(j;)H On the other hand, the precise 
assumptions on a(x) in Theorem A can probably be relaxed (see Conjecture A in Section IT~5]) : 
it certainly is possible for £ = 1 (see Theorem C). 

• An immediate corollary is the following coloristic result, which we do not see how to prove 
without using Theorem A: If a £ TC satisfies the growth condition of Theorem A, then every 
finite coloring of the integers has a monochromatic arithmetic progression of the form . 

• Although our result applies to rather exotic sequences, like the sequences mentioned in the 
examples ([2]) and ([3]), simply defined sequences like [ri^\ seem to be almost as hard to deal 
with as the general case. 

• Unlike the case where a{n) is a polynomial with zero constant term, it is not true that 
An (A— [a(n)]) ^ for a set of n G N with bounded gaps. To see this, take A = 2Z, o(n) = logn, 
and notice that [a(n)] takes odd values for every n £ [2 2i+1 , 2 2l+2 ) for every / £ N. With a bit 
more effort one can show that we have the same problem for every a £ Ti that satisfies the 
growth assumptions of Theorem A. 

To prove Theorem A we first use the correspondence principle of Furstenberg (see Section [L3]) 
to translate it into a statement about multiple recurrence in ergodic theory. The ergodic method 
used to prove Szemeredi's theorem Q24J) and its polynomial extension ([8]) does not seem to 
appl}0, so we use a different method instead. Our argument splits into three parts: 

result mentioned in [16] suggests the possibility that for every a £ Ti of super-polynomial growth there 
exists b G Ti of the same growth, that is, the limit of b/a is a non-zero real constant, such that b(n) is an odd 
integer for every n £ N. If this is the case, then no growth assumption on elements of TL with super-polynomial 
growth will be sufficient for our purposes. 

4 The main problem appears when one deals with distal systems. Unlike the case of a polynomial with zero 
constant term, for a € satisfying x k -< a(x) -< x k+1 for some non-negative integer k, successive applications of 
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(i) As it turns out, dealing with the full sequence [a(n)] greatly complicates our study, in 
particular step (Hi) below. Instead, we show that the range of [a(?i)] contains some suitably 
chosen polynomial patterns of fixed degree (Proposition 15.1) ). and we work with this collection 
of patterns henceforth. To obtain these patterns we use the Taylor expansion of the function 
a(x). Since some derivative of a(x) vanishes at infinity, it makes sense to expect (but is non- 
trivial to verify) that the range of [a(ra)] has a rich supply of polynomial progressions of fixed 
degree. 

(ii) For the polynomial patterns found in (i), we study the naturally associated multiple 
ergodic averages, and show that the nilfactor of the system controls their limiting behavior 
(Proposition I6.3j ). As a consequence, we reduce our problem to establishing a certain multiple 
recurrence property for nilsystems. This reduction to nilsystems step is carried out using a 
rather cumbersome application of the by now standard polynomial exhaustion technique (PET 
induction); we use it to eliminate some undesirable constants and majorize our multiple ergodic 
averages by some polynomial ones that we know how to control. 

(Hi) We verify the multiple recurrence property for nilsystems by comparing the multiple 
ergodic averages along the polynomial patterns of part (i) with some easier to handle averages 
that can be estimated using Furstenberg's classical multiple recurrence result. To carry out 
the comparison step we need an equidistribution result on nilmanifolds (Proposition 16. 4D . Be- 
cause our polynomial patterns consist of finite polynomial blocks rather than a single infinite 
polynomial sequence, the result we need does not seem to follow from the available quali- 
tative equidistribution results of polynomial sequences on nilmanifolds. Instead, we adapt a 
quantitative equidistribution result that was recently obtained by Green and Tao (|28j). 

To give an example of the polynomial patterns we are led to consider, let us look at the 
case of the sequence [a(n)] where a £ TL satisfies x -< a(x) -< x 2 . In this case, we can show 
that for every m £ N the range of the sequence [a(n)] contains arithmetic progressions with 
common difference m and length that increases to infinity as m —* oo. Therefore, we can derive 
Theorem A from the following result (that we find of interest on its own): 

Theorem B. Suppose that for every m £ N the set S C Z contains arithmetic progressions of 
the form {c m + ran: 1 < n < N m } where c m , N m are integers with N m — » oo. Let £ £ N. 
Then every A C Z with d(A) > contains arithmetic progressions of the form 

{r, r + s, r + 2s, . . . , r + is] 

for some r £ Z and non-zero s £ S. 

Remarks. • See Theorem 16.21 for a result that deals with more general polynomial progressions. 

• If c m = for infinitely many m £ N, the result follows easily from a finitistic version of 
Szemeredi's theorem. Such an easy derivation doesn't seem to be possible when we have no 
(usable) control over the constants c m . 

To prove Theorem A we shall need a generalization of Theorem B that deals with more 
complicated polynomial patterns (Theorem l6.2p , In order to illustrate some of the ideas needed 
to prove Theorem A in their simplest form, we choose to present the proof of Theorem B 
separately. 

Next we mention an improvement of our main result for i = 1. This result was first obtained 
(but never published) several years ago by Boshernitzan using a method different than ours. 

the operation [a(n + m)] — [a{n)\ — [a(m)], m € N, lead eventually to non-zero constant sequences (in n) which 
is a problem when one tries to prove the corresponding coloristic (van der Waerden type) result. 
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Theorem C. Let a 6 TC have polynomial growth and suppose that \a(x) — cp(x)\ — > oo for 
every p £ Z[x] and c£i. 

Then every A C 1* with d(A) > 0, contains x, y £ A £/ia£ satisfy y — x = [a(n)] /or some 
n G N witfi [a(n)] / 0. 

The proof of Theorem C is rather different (and much easier) than the proof of our main 
result (Theorem A). In order not to digress from our main objective we give it in the Appendix. 

Although we were not able to prove Theorem A under the more relaxed assumptions of The- 
orem C, we believe that the corresponding stronger statement should be true (see Conjecture A 
below). 

1.3. Results in ergodic language. All along the article we shall use the term measure pre- 
serving system, or the word system, to designate a quadruple (X,B, fj,,T), where (X,B,/jl) 
is a Lebesgue probability space, and T: X — > X is an invertible measurable map such that 
n(T~ 1 A) = fj,(A) for every A £ B. The necessary background from ergodic theory is given in 
Section El 

We shall use the following correspondence principle of Furstenberg (the formulation given is 
from |2j) to reformulate Theorems A, B, and C, in ergodic theoretic language: 

Furstenberg Correspondence Principle ([24], [2]). Let A. be a set of integers. 

Then there exist a system (X,B, fj,,T) and a set A £ B, with /j-(A) = d(A), and such that 

(5) J(A n (A - m) n . . . n (A - m)) > n(A n T~ ni A n • • • n T~ ni A), 

for every n\, . . . , ri£ G Z and £ £ N. 

For convenience we give the following definition: 

Definition 1.2. If I E N, we say that the set S of integers is a set of t-recurrence for the 
system (X, B, /i, T), if for every A £ B with (i{A) > we have 

(6) fi(A n T~ S A n T~ 2s A n • • • n T~ is A) > for some non-zero s £ S. 

We say that the set of integers S is a set of t-recurrence, or good for i-recurrence, if it is a set 
of ^-recurrence for every system. If S is a set of ^-recurrence for every I £ N, we say that S is 
a set of multiple recurrence. 

Remarks. • If S is a set of ^-recurrence, then ([6]) will be in fact satisfied for infinitely many 
s £ S (in fact S H mZ is also a set of ^-recurrence for every m £ N) . 

• We get a similar definition for sequences of integers by letting S to be the range of the 
sequence. In this case we say that a sequence is good for (.-recurrence, or good for multiple 
recurrence. 

Let us give some examples of sets of multiple recurrence and also mention some obstructions 
to recurrence. In the introduction we mentioned that if p £ %\x\ is non-constant and p(0) = 0, 
then the sequence pin) is good for multiple recurrence ([8]). Other examples of sets of multiple 
recurrence are IP sets, meaning sets that consist of all finite sums (with distinct entries) of 
some infinite set ([26]), and sets of the form \J n&N {a n ,2a n , . . . ,na n } where a n £ N (follows 
from a finite version of Szemeredi's theorem). 

Examples of sets that are bad for single recurrence are sets that do not contain multiples of 
some positive integer, and also the range of lacunary sequences. It follows that the sequences 
3n + 2,n 2 + l,p + 2 (p prime), n\, are bad for single recurrence. The set {n £ N: {ny/5} £ 
[1/2,3/4]} and the sequence [y/bn + 2] are bad for recurrence for the rotation by Vo on T (on 
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the other hand the sequences [y/En + 1] and [y/bn + 3] are good for single recurrence, see the 
discussion in Section [7]). 

Using Furstenberg's correspondence principle it is easy to see that the following result implies 
Theorem A (in fact it is not hard to show that they are equivalent): 

Theorem A'. Let a G 7i satisfy x k -< a(x) -< x k+1 for some non-negative integer k. 
Then S = {[o(l)], [a(2)], . . .} is a set of multiple recurrence. 

The following result implies Theorem B: 

Theorem B\ Suppose that for every m E N the set S C Z contains arithmetic progressions 
of the form {c m + mn : 1 < n < N m } where c m , N m are integers and N m — > oo as m —* oo. 
Then S is a set of multiple recurrence. 

The following result implies Theorem C: 

Theorem C. Let a £ TC have polynomial growth and suppose that \a(x) — cp(x)\ — ► oo /or 
every p G Z[x] and c£K. 

Then S = {[a(l)], [a(2)], . . .} is a set of single recurrence. 

1.4. Structure of the article. In Section [2] we give the necessary background from ergodic 
theory. Key to our study are some results about the structure of the characteristic factors of 
some multiple ergodic averages. 

In Section [3] we give the necessary background on nilsystems, and state some equidistribution 
results of sequences on nilmanifolds. A crucial ingredient for our study is the quantitative 
equidistribution result stated in Theorem 13.21 for connected groups. We generalize this result 
to not necessarily connected groups in Theorem 13.41 

In Section H] we prove Theorem B' which serves as a model for the more complicated result 
that involves Hardy field sequences (Theorem A'). 

In Section [5] we carry out the first step needed to prove Theorem A'. We show that if a £ 7i 
satisfies x k -< a(x) -< x k+1 for some non-negative integer k, then the range of the sequence 
[a(n)] some conveniently chosen polynomial patterns. 

In Section [6] we work with the patterns found in Section [5] and carry out the final two steps 
of the proof of Theorem A'. The first step is a "reduction to nilsystems" argument. On the 
second step we verify a multiple recurrence property for nilsystems. Our argument is similar 
to the one used to prove our model result Theorem B'. The only extra difficulty occurs in the 
"reduction to nilsystems" step which happens to be technically much more involved than the 
one needed for the our model result. 

The Appendix contains the proof of Theorem C. 

1.5. Further directions. Roughly speaking, the method used to prove Theorem A, or its 
equivalent version Theorem A', amounts to finding conveniently chosen polynomial pieces 
within the range of a sequence. These pieces should be chosen so that it is possible to (i) 
carry out a reduction to nilsystems step, and (ii) verify a certain recurrence property for nil- 
systems. In view of the tools that have recently surfaced and help us carry out steps (i) and 
(ii), this "polynomial method" appears to be rather flexible, and is very likely to find further 
applications. For example, it now looks within reach to show that every set of integers with 
positive density contains patterns of the form m, m + [ai(n)], . . . , m+ [afc(n)] for "most" choices 
of functions ai(x) that belong to some Hardy field and have polynomial growth. This belief is 
reinforced by recent extensions in [5] of the weakly mixing PET from [3]. 
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A more challenging problem is to find an example of a Hardy sequence of super-polynomial 
growth that is "good" for Szemeredi's theorem (i.e. the conclusion of Theorem A holds). For 
example, is the sequence [n loglogn ] good for Szemeredi's theorem? Concerning convergence 
results, if a £ TC satisfies the growth assumptions of Theorem A, then it seems likely that the 
range of the sequence [a(n)] can be split into "polynomial pieces" that we can control, and 
hence prove convergence in L? for the multiple ergodic averages 

1 N 

±_ ^ T^™)] /i • . . . • T^ a( - n ^ f e . 

n=l 

Since the growth assumptions in Theorem A can be relaxed when I = 1 (see Theorem C), it 
seems very likely that the same should be the case for general £: 

Conjecture A. Let a £ TC have polynomial growth and suppose that \a(x) — cp(x)\ — > oo for 
every p £ Z[x] and c£K, 

Then for every I £ N, every A C Z with d(A) > contains arithmetic progressions of the 
form 

(7) {m, m + [a(n)],m + 2[o(n)], . . . , m + -f[a(n)]} 

for some m E Z and n £ N with [a(n)] ^ 0. 

In view of the fact that Szemeredi's theorem on arithmetic progressions was a key ingre- 
dient in showing that the primes contain arbitrarily long arithmetic progressions (|27|). and 
likewise the polynomial Szemeredi theorem was key in establishing polynomial progressions in 
the primes ( [46] ) . the following result seems plausible: 

Conjecture B. Let a £ TC satisfy the growth assumptions of Theorem A (or Conjecture A). 
Then the prime numbers contain arbitrarily long arithmetic progressions of the form . 

1.6. Notational conventions. The following notation will be used throughout the article: 
N = {1,2, . . .}, Tf = f o T, e(x) = e 2mx , [x] denotes the integer part of x, {x} = x — [x], 
\\x\\ = d(x,Z), o mir „ >mk (l) denotes a quantity that goes to zero when mi, . . . ,rrik — ► +oo, by 
a(x) -< b(x) we mean lim^oo a(x)/b(x) = 0, when there is no danger of confusion we write 
oo instead of +oo. We use the symbol -C when some expression is majorized by a constant 
multiple of some other expression. We shall frequently abuse notation and denote the elements 
iZ of T, where t £ R, by t. 

Acknowledgment. The authors would like to thank T. Tao for helpful discussions related to 
the material of Section [3l and the referee for helpful comments. 

2. Background in ergodic theory 

Below we gather some basic notions and facts from ergodic theory that we use throughout 
the paper. The reader can find further background material in ergodic theory in [25] . [41] . [48] . 

2.1. Factors in ergodic theory. A homomorphism from a system (X,B, fJ,,T) onto a system 
(Y,T>, v, S) is a measurable map ir: X' — » Y', where X' is a T-invariant subset of X and Y' is 
an 5-invariant subset of Y, both of full measure, such that [i o 7r _1 = v and Sott(x) = tt o T(x) 
for x £ X' . When we have such a homomorphism we say that the system (Y, T>, u, S) is a factor 
of the system (X, £>, fj,, T). If the factor map it : X' — > Y' can be chosen to be injective, then we 
say that the systems (X, B,fj,,T) and (Y,T>, u, S) are isomorphic (bijective maps on Lebesgue 
spaces have measurable inverses). 
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A factor can be characterized (modulo isomorphism) by the data 7r~ 1 (2?) which is a T- 
invariant sub-cr-algebra of B, and any T-invariant sub-cr-algebra of B defines a factor; by a 
classical abuse of terminology we denote by the same letter the cr-algebra T> and its inverse image 
by 7T. In other words, if (Y, T>, v, S) is a factor of (X, B, fj,, T), we think of T> as a sub-cr-algebra of 
B. A factor can also be characterized (modulo isomorphism) by a T-invariant subalgebra T of 
L°°(X, B, fj), in which case V is the sub-cr-algebra generated by T, or equivalently, L 2 (X, V, fj) 
is the closure of T in L 2 (X, £>, fj). We shall sometimes abuse notation and use the sub-cr-algebra 
T> in place of the subspace L 2 (X,T>, fj). For example, if we write that a function is orthogonal 
to the factor T>, we mean it is orthogonal to the subspace L 2 (X,T>, /i). 

If P is a T-invariant sub-cr-algebra of B and / £ L 2 (fi), we define the conditional expectation 
E(f\T>) of f with respect to T> to be the orthogonal projection of / onto L 2 (T>). We frequently 
make use of the identities 



The transformation T is ergodic if Tf = f implies that / = c (a.e.) for some c £ C, and 
totally ergodic if T r f = f for some r £ N implies that / = c (a.e.) for some c £ C. 

Every system (X, B,fi,T) has an ergodic decomposition, meaning that we can write /J, = 
J fit ciA(t), where A is a probability measure on [0, 1] and fit are T-invariant probability measures 
on (X,B) such that the systems (X, B, fit,T) are ergodic for t £ [0,1]. We sometimes denote 
the ergodic components by Tt,t £ [0, 1]. 

We say that (X, B,/i, T) is an inverse limit of a sequence of factors (X,Bj, fi,T) if (Bj)j^ 
is an increasing sequence of T-invariant sub-cr-algebras such that VjeN^i = S up to sets of 
measure zero. 

2.2. Characteristic factors for polynomial averages. Following [31], for every system 
(X,B, fi,T) and function / £ L°°(/x), we define inductively the seminorms |||/|||^ as follows: For 
I = 1 we set I /|||i = / |E(/|X)|cfyz, where X is the cr-algebra of T-invariant sets. For I > 2 we 
set 



It was shown in [31] that for every integer I > 1, ||| • |||i is a seminorm on L°°(/i) and it 
defines factors -Zf_i = Zg_i(T) in the following manner: the T-invariant sub-cr-algebra 2^_i is 
characterized by 



We call Zi the l-step nilfactor of the system. By Z we denote the smallest factor that is an 
extension of all the factors Zg for I £ N, and we call Z the nilfactor of the system. If / is a 
bounded function that satisfies E^(/|^(T)) = 0, then E^g^(/ ® f\Zg_i(T x T)) = (this is 
implicit in [31]). Also, if T where t £ [0,1] are the ergodic components of the system, then 
E(f\Z e (T)) = if and only if E(f\Z e (T t )) = for a.e. t £ [0, 1]. 

The factors Zg are of particular interest because they control the limiting behavior in L 2 of 
several multiple ergodic averages. The next result makes this more precise. 

^In [31] the authors work with ergodic systems, in which case |||/|||i = |/ / dfi\, and real valued functions, but 
the whole discussion can be carried out for non-ergodic systems as well and complex valued functions without 
extra difficulties. 




(8) 




71=1 



for / £ L°°( M ), E(/|^_i) = if and only if |||/|||, = 0. 
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Theorem 2.1 (Leibman [37])- Letpi, . . . ,p s : Z r — ► Z be a family of non- constant essentially 
distinct polynomials. 

Then there exists a non-negative integer I = l(pi,P2, ■ ■ ■ ,p s ) with the following property: If 
(X,B, (A,T) is a system and at least one of the functions f\, . . . ,f s £ L°°(X) is orthogonal to 
the factor Zg(T), then for every F0lner sequence (<J?jv)jVeN in Z' r we have 



lim 



0. 



We say that Zg(T) is a characteristic factor associated to the family pi,P2, ■ ■ ■ ,Ps when this 
last fact is true. 

We shall also use the following easy corollary of the previous result: 

Corollary 2.2. Let (X, B,p,,T) be a system, p%, . . . ,p s : If — > Z be a family of non-constant 
essentially distinct polynomials, and let Zi(T) be a characteristic factor for this family. Suppose 
that at least one of the functions fo, fi, ■ ■ ■ , f s G L co (X) is orthogonal to the factor Zg + \{T). 
Then for every F0lner sequence {^m)n&^ i n Z r we have 



lim . 



—i E / h-T^h-...-T^f s dn 



0. 



Proof. If fi is orthogonal to the factor Zg+i(T), then as mentioned above, the function /, <g) fi 
is orthogonal to the factor Zg{T x T). Therefore, by Theorem 12. II the averages 



/o(x) • /o(y) • MTKWx) ■ h{T^y) ..... f s (TP*^x) ■ f s (T Mn) y) dn{x)dn{y) 



converge to zero. This immediately implies the advertised result. 



□ 



Host and Kra ([31]) showed that the factors Zg are of purely algebraic structure (a closely 
related result was subsequently proved by Ziegler ([50])), a result that is crucial for our study. 

Theorem 2.3 (Host & Kra [31]). Let (X,B,(i,T) be a system and I G N. 

Then a.e. ergodic component of the factor Zg{T) is an inverse limit of £- step nilsystems. 



This result justifies our name for the factors Zj>(T). 



3. Equidistribution results on nilmanifolds 

In this section we give some background material on nilsystems and gather some equidis- 
tribution results of polynomial sequences on nilmanifolds that will be used later. Nilsystems 
play a central role in our study because they provide a sufficient class for verifying several mul- 
tiple recurrence results for general measure preserving systems. In fact, when one deals with 
"polynomial recurrence" this is usually a consequence of Theorems 12.11 and 12.31 These two 
results, taken together, show that nilsystems control the limiting behavior of the corresponding 
polynomial multiple ergodic averages. 
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3.1. Nilmanifolds, definition and basic properties. The reader can find fundamental 
properties of nilsystems related to our study in [I], [40], [38], and [36] . 

Given a topological group G, we denote the identity element by e, and we let Go denote the 
connected component of e. If A, B C G, then [A, B] is defined to be the subgroup generated by 
elements of the form {[a, b] : a £ A,b £ i?} where [a, 6] = aba~ 1 b~ 1 . We define the commutator 
subgroups recursively by G\ = G and Gfc + i = [G, G&]. A group G is said to be k-step nilpotent 
if its (k + 1) commutator G^+i is trivial. If G is a /c-step nilpotent Lie group and T is a discrete 
cocompact subgroup, then the compact space X = G/T is said to be a k-step nilmanifold. The 
group G acts on G/T by left translation where the translation by a fixed element a € G is given 
by T a (gr) = (ag)T. By mx we denote the unique probability measure on X that is invariant 
under the action of G by left translations (called the normalized Haar measure) and G/T denote 
the Borel cr-algebra of G/T. Fixing an element a £ G, we call the system (G/T, G/T, to, T a ) a 
k-step nilsystem. We call the elements of G nilrotations. 

Given a nilmanifold X = G/T, an ergodic nilrotation is an element a £ G such that the 
sequence (a n r) ng N is uniformly distributed on X. If X is a connected nilmanifold and a 6 G 
is an ergodic nilrotation it can be shown that for every d E N the nilrotation a d is also ergodic. 

Example 1. On the space G = Z, xl 2 , define multiplication as follows: 
if gi = (m,xi,x 2 ) and g 2 = (n,y 1 ,y 2 ), then 

9i ■ 92 = (m + n, xi + y 1} x 2 +y 2 + myi). 

Then G with • is a 2-step nilpotent Lie group and the group Go = {0} x M 2 is Abelian. The 
discrete subgroup T = Z 3 is cocompact and X = G/T is connected. It can be shown that the 
nilrotation a = (l,a,/3) is ergodic if and only if a is an irrational number. 

We remark that the representation of a nilmanifold X as a homogeneous space of a nilpotent 
Lie group G is not unique. If X is a connected nilmanifold, it can be shown (|36|) that it admits 
a representation of the form X = G/T such that: Go is simply connected and G = Gor. In 
the sequel, whenever X is connected, we will always assume that G satisfies these two extra 
assumptions. 

3.2. Qualitative equidistribution results on nilmanifolds. If G is a nilpotent group, then 
a sequence g : 7L — > G of the form g(n) = where o« S G and pi are polynomials 
taking integer values at the integers is called a polynomial sequence in G. If the maximum of 
the degrees of the polynomials pi is at most d we say that the degree of g(n) is at most d. A 
polynomial sequence on the nilmanifold X = G/T is a sequence of the form (g(n)T) n ^i where 
g: % — > G is a polynomial sequence in G. 

Theorem 3.1 (Leibman [36]). Suppose that X = G/T is a connected nilmanifold and g(n) is 
a polynomial sequence in G. Let Z = G/([Go, Go]T) and it: X — > Z be the natural projection. 

Then for every x £ X the sequence (g(n)x) n ^ is equidistributed in X if and only if the 
sequence (g(n)7r(x)) n( z^ is equidistributed in Z. 

Note that [Go, Go] is a normal subgroup of G and so G/[Go, Go] is a group. 

3.3. Quantitative equidistribution results on nilmanifolds. 

3.3.1. The case of a connected group. We will later use a quantitative version of Theorem 13.11 
that was recently obtained by Green and Tao in [28]. In order to state it we need to review 
some notions that were introduced in [28] . 
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Given a nilmanifold X = G/T, the horizontal torus is denned to be the compact Abelian 
group H = G/([G,G]T). If X is connected, then H is isomorphic to some finite dimensional 
torus T'. By ir: X — > H we denote the natural projection map. A horizontal character 
is a continuous homomorphism x of G that satisfies xidl) = x(d) f° r every 7 G T. Since 
every character annihilates [G, G], every horizontal character factors through H, and so can 
be thought of as a character of the horizontal torus. Since H is identifiable with a finite 
dimensional torus T' (we assume that X is connected), x can & l so be thought of as a character 
of T', in which case there exists a unique k E l) such that = k ■ t, where • denotes the 
inner product operation. We refer to k as the frequency of x an d ||xl| = \ K \ as the frequency 
magnitude of X- 

Example 2. Let X be as in Example 1. The map x{ m i x li x 2) = e (^i)> where I € Z, is a 
horizontal character of G and the map cp(m, x%, x%) = x\ (mod 1) induces an identification of 
the horizontal torus with T. Under this identification, x is mapped to the character Xi(ii) = 
e(Ziii) of T. 

If p : Z — » M is a polynomial sequence of degree fc, then p can be uniquely expressed in the 
form p(n) = £j =0 (")«i where a» e 1. For iV G N we define 

(9) l|e(K"))llc~[iV] = ™* k ( Ni W a iW) 

where ||x|| = d(x,Z). One also gets a similar definition for polynomials p: 7L — » T. 
Given N £ N, a finite sequence (<7(n)r)i< n <Ar is said to be 5-equidistributed if 

JV 

1 < <y \\F\\ 



j-J2F(g(n)T)- f F dm x 

n=l JX 



\N ^ 

n=l 

for every Lipschitz function F: X — > C where 



I Lip 



|F(x)-F(y)| 



1^ = 11^1100 + . -P, dx{Xfy) 



for some appropriate metric dx on 

xE We can now state the quantitative equidistribution 
result that we shall use. It can be easily derived from [28]. 

Theorem 3.2 (Green & Tao |28j). Let X = G/T be a nilmanifold with G connected and 
simply connected and d £ nQ 

Then there exists C = Cx,d > with the following property: For every N £ N and 5 small 
enough, if g: Z — > G is a polynomial sequence of degree at most d such that the finite sequence 
(g(n)r)i< n <Ar is not 5-equidistributed, then there exists a non-trivial horizontal character x, 
with frequency magnitude \\x\\ < 5~ c , such that 

(10) \\x(g(n))\\c°°{N] < c i^ c 

for some absolute constant c\, where x ^ s thought of as a character of the horizontal torus 
H = T l and g(n) in (|10p as a polynomial sequence in T l . 



The metric dx is defined in |28j using a Malcev basis X of X, so the notion of equidistribution we get does 
depend on the choice of the Malcev basis X. The exact definition of dx will not be needed anywhere in our 
article, so we omit it. 

In our context, we assume that the Malcev basis and hence the metric on X is fixed. So unlike the more 
refined result stated in [2S], for the result we state here there is no reason to impose restrictions on the Malcev 
basis we use (or even refer to it). 
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Remarks. • We are not going to make use of the explicit form of the upper bounds on ||x|| and 
[|x(fl , ( n ))llc7 oc [JV]' an y u PP er bound that depends only on 5, X, and d, will do just fine. 

• Condition (fPUl) implies that the finite sequence (7r(<7(n)r))i< n <jvi is not (026 )-equidistributed 
in G/([G,G]T) for all every N± < 026° N, for some absolute constant c^. 

Example 3. It is instructive to interpret the previous result in some special case. Let X = T 
(with the standard metric), and suppose that the polynomial sequence on T is given by p(n) = 
n d a + q(n) where d G N, a G R, and q G 1t[x] satisfies degq < d — 1. In this case Theorem 13.21 
reads as follows: There exists C = Cd > such that for every N £ N and every 5 small enough, 
if the finite sequence (n d a + q(n))i< n <N is not 5-equidistributed in T, then \\ka\\ < c\5~ c /N d 
for some k£Z with \k\ < 5~ c and some absolute constant c\ > 0. 

3.3.2. The general case. In this subsection we establish an extension of Theorem 13.21 to the 
case where the group G is not necessarily connected (but we always assume that X = G/T is 
connected and G is simply connected). 

Let G be a group. A map T: G — > G is said to be affine if T(g) = b ■ S(g) for some 
homomorphism S of G and b G G. The homomorphism S is said to be unipotent if there exists 
n G N so that (S — Id) n = 0. In this case we say that the affine transformation T is a unipotent 
affine transformation. 

If X = G/T is a connected nilmanifold, the affine torus of X is defined to be the homogeneous 
space A = G/([Gq, Gq]T). The next lemma (whose statement and proof are reproduced from 
[23] ) explains our terminology (notice that if H is the group G/[Go,Go], then Hq is Abelian). 

Proposition 3.3 (F. & Kra |23|). Let X = G/T be a connected nilmanifold and suppose that 
the group Gq is Abelian. 

Then the nilrotations T a {x) = ax, a £ G, defined on X with the normalized Haar measure 
mx, are simultaneously isomorphic to a collection of unipotent affine transformations on some 
finite dimensional torus with the normalized Haar measure. Furthermore, the conjugation map 
can be taken to be continuous. 

Proof. We start with a reduction. As we mentioned in Section \3. II since X is connected we can 
assume that G = GoT. We claim that under our additional assumption that Go is Abelian we 
have that Tq = T n Go is a normal subgroup of G. Let 70 £ To and g = g^j, where go G Go 
and 7 G r. Since Go is normal in G, we have that g~ 1 ^/og G Go- Moreover, 

9^109 = ^ 1 9o 1 ^/o9o7 = 7 _1 7o7 G r > 

the last equality being valid since Go is Abelian. Hence, g~ 1 jog G To and To is normal in G, 
proving our claim. After substituting G/Tq for G and T/Tq for T, we have X = (G/ro)/(r/To). 
Therefore, we can assume that GoHT = {e}. Note that now Go is a connected compact Abelian 
Lie group, and so is isomorphic to some finite dimensional torus T d . 

Every g G G is uniquely representable in the form g = goj, with go G Go, 7 G T. The map 
<p: X — > Go, given by (f>{gT) = go is a well defined homeomorphism. Since 4>{hgT) = hcf)(gT) 
for every h G Go, the measure ^>(/i) on Go is invariant under left translations. Thus cj)(m) is 
the normalized Haar measure on Go- If a = ao7, g = goj' with ao,go G Go and 7,7' G T, then 
agT = ao7<?o7 -1 r. Since 7507 1 G Go, we have that 4>(agT) = ao7(?o7 _1 - Hence <fi conjugates 
T a to T' a : Go — > Go defined by 

T a(9o) = (pTa^ 1 = a 75'o7 _1 - 
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Since Go is Abelian this is an affine map; its linear part go t— * 7go7 _1 is unipotent since G is 
nilpotent. Letting tp: Gq — > T d denote the isomorphism between Go and T d , we have that T a 
is isomorphic to the unipotent affine transformation S = if)T' a if)~ l acting on T . □ 

Because of this lemma, we can identify the affine torus A of a nilmanifold X with a finite 
dimensional torus T' and think of a nilrotation acting on A as a unipotent affine transformation 
on T*. 

Example 4. Let X be as in Example 1. We have I-(Zx R 2 )/(Z x 0), so we can assume 
that we have equality. If a = (m, 01,02) G Z x T 2 , then the map 0: Z x T 2 T 2 , defined by 
4>(k,ti,t2) = (£1,^2) (mod 1), factors through X, and conjugates the nilrotation T a (x) = ax to 
the unipotent affine transformation S: T 2 — > T 2 defined by 

S(ti,t 2 ) = (*i + «i, *2 + wti + 02). 

A quasi- character of a nilmanifold X = G/T is a function ?/>: G — > C that is a continuous 
homomorphism of Go and satisfies if)(g^) = ip(g) for every 7 G T. Every quasi-character 
annihilates [Go, Go], so it factors through the affine torus A of X. Under the identification of 
Proposition 13,31 we have that A ~ T' and every quasi-character of X is mapped to a character 
of T . Therefore, thinking of if) as a character of T' we have if)(t) = K-t for some k 6 Z ! , where 
• denotes the inner product operation. We refer to k as the frequency of if) and \\if)\\ = \n\ as 
the frequency magnitude of if). 

Example 5. Let X be as in Example 1. The map ip(m, x±, x-i) = 6(^1x1+^2^2), where lx, I2 6Z, 
is a quasi-character of X. Notice that if) is not a homomorphism of G and and so it is not a 
character of X. The map <p(m, x±, x<x) = (x\, X2) (mod 1) induces an identification of the affine 
torus (in this case A = X) with T 2 . Under this identification, tp is mapped to the character 
ipi(ti,t 2 ) = e(Ziii + Z 2 t 2 ) of T 2 . 

We are now ready to state the advertised extension of Theorem 13.21 

Theorem 3.4 (Corollary of Theorem 13.2ft . Let X = G/T be a connected nilmanifold (we 
always assume that Go is simply connected) and d G N. 

Then there exists C = Cx,d > with the following property: For every N G N and 5 small 
enough, if g: Z — > G is a polynomial sequence of degree at most d such that the finite sequence 
(g(n)r)i< n <jv is not 5-equidistributed, then there exists a non-trivial quasi- character if) with 
frequency magnitude \\if)\\ < 5~ c such that 

(11) \\i)(g(n))\\ Coo[N] < Cl 5~ c 

for some absolute constant c\ , where we think of if) as a character of some finite dimensional 
torus T (the affine torus) and g(n) as a polynomial sequence of unipotent affine transformations 
on T l . 

Remark. We have ip(g(n)) = e(p(n)) for some p G M[x] and so ||V'(^( n ))llc oo [Ar] ^ s wen defined. 

We first make some observations that will help us deduce Theorem 13.41 from Theorem [321 As 
we remarked in Section 13.11 if X = G/T is a connected nilmanifold we can assume that every 
g G G is representable in the form go7; where go G Go and 7 G T. Therefore, X = (GqT)/T can 
be identified with the nilmanifold Go/ (Go HT). If a G G we have a = ao7 for some ao G Go and 
7 G T. Since Go is a normal subgroup of G we have that a n = a„7™ for some a n G Go- Using 
this, one easily verifies that any degree d polynomial sequence g(n) in G factors as follows: 
9( n ) = 9o( n )l( n ) where go(n) G Go for n G N and j(n) is a degree d polynomial sequence in T. 
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By Proposition 3.9 in [35] (for a more direct proof see Proposition 4.1 in [9]) we get that <?o( n ) 
is also a polynomial sequence in Gq. Moreover, if G is fc-step nilpotent, a close examination of 
the proof of Proposition 4.1 in [9] reveals that the degree of go(n) is at most dk. 

Example 6. Let X be the nilmanifold of Example 1. We have Go = {0} x M. 2 and the map 
<p: Z x T 2 — » T 2 , defined by 4>(k, t\, t^) = (0,ti,t2), induces an identification between X and 
the nilmanifold Go/ {Go n T) ~ T 2 . For a = (2, a, a) the polynomial sequence g(n) = a n in G 
factors as 

g(n) = (2n, no, n 2 a) = ap • 6q -7™ 
where ao = (0, a, 0),&o = (0)0, a) 6 Go, and 7 = (2,0,0) G T. In this case we have that 
go(^) = a o ' ^0 * s a degree 2 polynomial sequence in Go- 

Proof of Theorem \3.4\ Let G = G(y /(G Q nr),fcd ^ e ^ ne positive number defined in Theorem 13.21 
Suppose that (g(n)r)i<„<jv is not 5-equidistributed in X = G/T for some small enough 5. 
As discussed before, we have g(n)T = go(n)T where go(n) is a polynomial sequence in Go 
of degree at most kd. Since X can be identified with Go /(Go fl T), it follows that the finite 
sequence (<?o( n )(Go nr))i< n <Ar is not <5-equidistributed in Go/ '(Go f)T). Since Go is connected 
and simply connected, and the polynomial sequence gain) is defined in Go, and has degree at 
most kd, Theorem 13.21 applies. We get that there exists a non-trivial horizontal character xo 
of G /(G n T) such that ||xo|| < o~~ C and 

IIXo(5oH)|lc°°[JV] ^ ci8~ c . 

We can lift xo to a quasi-character of X as follows: Consider the discrete group G/Gq. 
Since G = GoT we have G/Go = {7G0: 7 G T}. Let T be a subset of T so that the map 
7 — > 7G0, from T to G/Go, is bijective. Then every element h G G has a unique representation 
/i = /io7 with /io G Go and 7 G T. We define the map -0 : G — > C by V ; (^) = Xo(^o)- Since 
Xo(5o7o) = Xo(9o) for every go G Go and 70 G Go n T, it follows that tp agrees with xo on Go- 
Furthermore, writing 7 G T as 7 = 707 with 70 G Go H T and 7 G T, and using again that 
Xo(ffo7o) = Xo(go) for 5o G G , one gets that V(5o7) = ^(50707) = Xo(9o7o) = Xo(go) = Hdo) 
for every go G Go and 7 G T. Since every g £ G can be written as 5 = go7 for some go G Go 
and 7 G r, we conclude that ip(g"f) = ip(go) = it>{g) for every g G G and 7 G T. We have 
established that tp is a quasi-character of X that extends the character xo- 

Since ip(g(n)) = Xo(go(n)), we get that ||^|| = ||xo|| < 3~ C arid also that equation (fTT]) is 
satisfied. Lastly, i/> factors through the affine torus A and by Proposition ^. 31 A can be identified 
with a finite dimensional torus T'. Under this identification tp is mapped to a character of 
and the polynomial sequence g(n) on the affine torus A is mapped to a polynomial sequence of 
unipotent affine transformations on T l . This completes the proof. □ 

4. A MODEL MULTIPLE RECURRENCE RESULT 

We are going to prove Theorem B'. For convenience, we repeat its statement: 

Theorem B\ Suppose that for every m G N the set S C Z contains arithmetic progressions 
of the form {c m + ran : 1 < n < N m } where c m , N m are integers and N m -^00 as m — > 00. 
Then S is a set of multiple recurrence. 

Part of the proof of Theorem B' (the proof of Proposition ^. 51 and the final step of the proof of 
Theorem B' in Section T4.2j) carries almost verbatim to the more complicated Hardy field setup 
(proof of Theorem A'). In order to better illustrate the ideas we chose to give the argument in 
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this simpler setup. It splits in two parts, we first reduce things to nilsystems and then verify a 
multiple recurrence property for nilsystems. 

4.1. Reduction to nilsystems. We shall study the multiple ergodic averages that are nat- 
urally associated to the multiple recurrence problem of Theorem B'. We shall show that the 
nilfactor is characteristic for L 2 -convergence of these averages. Using Theorem 12.31 it is then 
not hard to see that in order to establish Theorem B' it suffices to verify a multiple recurrence 
property for nilsystems. 

As it is often the case when proving such reduction results, a key tool is a Hilbert space 
version of a classical elementary estimate of van der Corput. Its proof is identical with the 
proof of this classical estimate (e.g. Lemma 3.1 in [34] ) . 



Lemma 4.1. Let v\, . . . , vn be vectors of a Hilbert space with \\vi\\ < 1 for i = 1, . . . , N. 
Then for every integer H between 1 and N we have 

n=l h=l n=l 

Lemma 4.2. Let (X, B, fj,, T) be a system and /i,/2 S L°°(fi) satisfy fi-LZ for i = 1 or 2, 
where Z is the nilfactor of the system. Let c m ,N m be integers and N m — > oo as m — > oo. 
Then the averages 

1 M 1 N m 

(12) — — f jiCm+mn ^ _ rp2(c m +mn) j\ 

m=l rn n=l 

converge to in L 2 (fi) as M — > oo. 

Proof. We can assume that /2-L.Z, the proof is similar in the other case. Furthermore, we can 
assume that ||/i||xoo , ||^2 ||x,°° — Let A M denote the averages (fl2j) . We have 

II /J,, II 4 1 \ ^ \ rpcm+mn f rp2(c m +mn) f 

m=l m n=l 

Using Lemma 14.11 and the Cauchy-Schwarz inequality we get that for every LL m ,\ such that 
H m ,i ~< N m (meaning H m ^/N m -»0asm-> oo) the last expression is bounded by 

'y * 'y ^ ^ ^ / '-pc m +mn _ rp2(c m +mn) _ jic m +mn+m/ii ji2(c m +mn+m/ii) ^ 



L2( M ) 



4 

M 



m=l 



hi=l 



n=l 1 



+ M ,iW(l)- 

Factoring out the measure preserving transformation T° mJrmn an d using the Cauchy-Schwarz 
inequality we see that the last expression is bounded by 

M . H m A , _ at„ 



4 
M 



m=l ' h 1= l n=l 



->c m +mn 



-mn+2mh 



dn+o M ,H m ,A 1 )- 



Factoring out T Cm and using that H/iH^oc < 1 we see that the last expression is bounded by 

Hrr ' " 2 



4 

M 



E\ r \ rpmn 7 rpmn+2mh\ f 



m=l 



hi=l 



71=1 



+ Oi|f,fl- x (l). 



£ 2 M 
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Using Lemma [4. II again, factoring out T mn , and noticing that the resulting expression no longer 
depends on n, we get that for every H mj i, FL m ,2 < N m this last expression is bounded by 



M E JT- E JT- E I / h ■ T^h-T^h ■ T 2mh ^f 2 dp +o M , Hmtl , Hmi2 (l). 

We can choose H mi i,H m) 2 to be -< N m , increase to oo as m — > oo, and furthermore such that 
the subsets of N 3 defined by 

$ M = {(m,/ii,/i 2 ) £ N 3 : 1 < m < M, 1 < /ii < if m ,i, 1 < /i 2 < #m, 2 } 
for MeN, form a F0lner sequence. Since /2-Li?, by Corollary 12.21 we have 

_ \ / _ rp2mhi jF^ _ rjrmh2 _ rp2mhi+mh2 

\<&m\ J 

(m,fei>fe)e*M 

converges to zero as M — > 00. This shows that the averages ^4m converge to zero in L 2 (/i) as 
M — > cxd and finishes the proof. □ 

The proof of the next result is very similar to the proof of Lemma 14.21 only notationally 
more complicated, and so we omit it. 

Proposition 4.3. Let (X, B, /i, T) be a system and fi,...,fe G satisfy fi-LZ for some 

i = 1, . . . ,£, where Z is the nil] "actor of the system. Let c m , N m be integers with N m — > cxd. 
Then the averages 

, M N m 

f rpCm+mn _ rp2{c m +mn) ^ _ _ rp£(c m +mn) ^ 

m=l n=l 

converge to in L 2 (/i) as M — > cxd. 

4.1.1. Dealing with nilsy stems, a key equidistribution result. 

Lemma 4.4. Lei Bcl\Q and if C N be finite sets, and (e m ) m6 N be a sequence of positive 
real numbers such that lirrim^oo e m = 0. 

Then the set S = {m G N: ||m fc a|| < e m for some a G £?, anc? A; G if} /ias density 0. 

Proof. If a is irrational, then the sequence (m fc a) mg N is equidistributed in T. Hence, for every 
s > we have d({m G N: ||m fc a|| < e}) = 2e. It follows that for fixed k G N and a irrational, 
the set Sk t a = { m G N: ||m fc a|| < e m } has zero density. Since S is contained in a finite union 
of sets of the form S^ a it also has zero density. □ 

Remember that given a connected nilmanifold X = G/T, an element a G G is an ergodic 
nilrotation if the sequence (a"T) ng N equidistributed in X. 

Proposition 4.5. Let X = G/T be a connected nilmanifold and a G G be an ergodic nilrotation. 
Let c m be positive integers and (iVm)meN be a sequence of integers with N m — > 00. 
Then for every F G C(X) we have 



M N m 

lim — V (—Y F{a Cm+mn T)) = / F dm x . 



771=1 
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Proof. It suffices to show that for every 5 > 0, for a set of m G N of density 1, the finite 
sequence (a Cm+m "T)i< n <jv m is 5-equidistributed in X. 

Let 5 > be small enough, and suppose that the finite sequence (a Cm+mn r)i< n <jv m is not 
<5-equidistributed for some m G N. By Theorem 13.41 there exist a constant M = M$ t x (M does 
not depend on m, N m , or Cyri) cincl a quasi-character ip with. H^ll — such, that 

(13) \W Cm+mn )\\ c - [Nm] <M. 

As explained in Section 13.3.21 the affine torus A of X can be identified with a finite dimensional 
torus T l . After making this identification, we have ip(t) = k ■ t for some non-zero k£Z ! , and 
the nilrotation a induces a cf-step unipotent affine transformation T a : T l — * T\ Let vr(a) = 
(aiZ, . . . , a s Z), where on G R, be the projection of a on the horizontal torus T s . Since ir(a) is 
an ergodic rotation the real numbers 1, oti, . . . , a s are rationally independent. The coordinates 
of T™e, where e is the identity element of T l , are polynomials of n, and so n-T™e is a polynomial 
of n. Moreover, it is not hard to see that the leading term of the polynomial k ■ T™e has the 
from an k , where k < d and 

1 s 

(14) a = — r i a ii Ti^TL not all of them zero with \r^\ < c\ • M 

2=1 

for some constant c\ that depends only on a. From this and the definition of |Hlc°°[jv] ( see ©) 
it follows that 



\\ib(a Cm+mn )\\ = \\ib(T Cm+mn e)\\ >N 

im« )\\c°°[Nm] Wr^a c '\\c^[N m ] - iV ' 

Combining this with (I13H we get that 
(15) 



M 
< — r- 



Since k < d, and by (|14[) we have only finitely many options for (the irrational) a, Lemma 
applies and shows that the set ofm £ N that satisfy equation (fl~5j) has zero density. This shows 
that the finite sequence (a Cm+mr T)i< n <Ar m is 5-equidistributed in X for a set of m £ N with 
density 1, completing the proof. □ 

4.2. Conclusion of the argument. We first use Proposition 14.31 to carry out a reduction to 
nilsystems step, and then use the equidistribution result of Proposition 14.51 to verify a multiple 
recurrence result for nilsystems. This will enable us conclude the proof of Theorem B'. We first 
need two simple lemmas. 

Lemma 4.6. Suppose that c m , N m are integers and N m — » oo as m — > oo. Let 

S = {c m + ran: 1 < n < N m , m G N}. 

Then for every r, m G N there exist c rt m,N r ^ m G N ; with N r ^ m — > oo as m — > oo ; and such 
that 

S r = {f{c r ,m + ran), 1 < n < N r ^ m , m G N} C S. 

Proof. Suppose that (r, m) = d, then m = dm\ for some m\ G N such that (mi, r) = 1. Choose 
1 < < r such that km\ = —c m (mod r). Then c m + m\{drn + k) = r(e r . TO + mn) for some 
c rim G N, and so r(c r>m + mn) G S for 1 < n < K m , where K m = (N mi — r)/{dr). The result 
follows. □ 
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Lemma 4.7. Let X = G/T be a connected nilmanifold and a £ G be an ergodic nilrotation. 

Then there exists a connected sub-nilmanifold Z of X e such that for a.e. g G G the element 
b g = (g~ 1 ag, g~ 1 a 2 g, . . . ,g~ x a l g) acts ergodically on Z . 

Remark. The independence of Z on the generic g G G will not be needed, only that Z is 
connected will be used. 

Proof. This is an easy consequence of a limit formula that appears in Theorem 2.2 of [49J (the 
details of the deduction appear in Corollary 2.10 of [2JJ). □ 

We are now ready to give the proof of Theorem B'. 

Proof of Theorem B'. Fix I G N. For r S N let S r be the subset of S defined in Lemma 14.61 
It suffices to show that for every system (Y, £>, /x, T), and / G L°°(n) non-negative and not a.e. 
zero, there exists an r G N such that 

-, M -. N r ,m ,. 

(16) liminf — V ( V / / • T T ^ m+mn ^f T £r ^ m+mn ^ fda] > 0. 

m=l ' n=l 

Suppose first that the system is ergodic. Using a slight modification of Proposition 14.31 we 
see that the nilfactor Z is characteristic for the multiple ergodic averages appearing in (|16|) 
(remember Z contains all factors Zi for I G N). Therefore, it suffices to verify (|16p with K(f\Z) 
in place of /. As a consequence, by Theorem 12.31 we can assume that our system is an inverse 
limit of nilsystems. 

In this case, for given e > (to be specified later) there exists a finite step nilfactor N such 
that h = E(/|AT) satisfies ||/ — h\\ L2 ^ < e. It is easy to verify that 



(17) / / • T n f ■ T m f dn- I h-T n h- T in h d\i 



< C\E 



for every n G N, where c\ is some absolute constant that depends only on /. Using an 
appropriate conjugation we can assume that T = T a is an ergodic nilrotation acting on a 
nilmanifold X, fj, = mx, and h is a non-negative, bounded measurable function on X, with 
f h dmx = J f dfjL. We are going to work with these extra assumptions henceforth. 

Let Xq be the connected component of the nilmanifold X. It is easy to see that there exists 
an ro G N such that the nilmanifold X is the disjoint union of the connected sub-nilmanifolds 
Xi = a l Xo, i = 0, . . . , ro — 1, and a r ° acts ergodically on each JQ. 

For r = ro, we shall see that f)16|) follows easily from Szemeredi's theorem and the identity 

M , N, 

~~' I I i . i I. 1 . r, i i c, ,., — III II I \ / / / ;'n (c, ...... — /;/ /; ) \ -'■ ■■ j 



m=l ' n=l 



1Y± - • I , III, r. 

(18) Jim — (jj— Yl J h ^ • h{a ro{Cr ' m+mn) x) • . • • • h(a er °^" +mn ^ x) dm : 

1 N f 

Jim — Y / h(x) ■ h(a ron x) h(a er ° n x) dm x . 



n=l ■ 

We first verify (|18f) . An easy approximation argument shows that it suffices to verify f 1 1 8 [> 
for every h G C(X). Our plan is to use Lemma 14.51 to establish a stronger pointwise result. 
We can assume that x = gT is an element of Xq, a similar argument applies if x G o^Xq for 
i = 1, . . . , ro — 1. An easy computation shows that the limit 



1 A / 1 * 



i\ m 



(19) lim — Y ("jTr— Yl Ka ro[Cr ' m+mn) x) • . • • • h{a lro ^ m+mn) x] 



M->oo M ^ \N rm 

m=l ' n=l 
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is equal to the limit 



M N r ,r, 



(20) iiE (w- E A(4 c -~ +m " ) f: 



m=l n=l 



where 



b g = (g- 1 a r °g,g- 1 a 2r <>g,...,g- 1 a*<>g), 



. . . , xi) = h(gxi) • . . . • h(gxc) (e C(X^)), and f = r . Since a r ° acts ergodically on the 
connected nilmanifold Xo, by Lemma [4. 71 there exists a connected sub-nilmanifold Z of Xq such 
that for a.e. g € G the element 6 9 acts ergodically on Z . For those values of g, Lemma 14.5 
gives that the limit (|20p is equal to j h dmz- Since b g acts ergodically on Z, this integral is 
also equal to the limit 

1 - 
lim — Vtef) 

71=1 

which can be rewritten as 

N 



21) lim i- ^ /i(a ron a;) • . . . • /i(a £r ° n x) 



n=l 



We have established the equality of the limits (|T9|) and (|2T|) for a.e. iGlo. As we mentioned, 
a similar argument applies for a.e. x £ X and this readily implies (|18p . 
Next we use fllSf) to establish (116p . In this regard, we estimate the limit 



JV 

(22) Jim — / h(x) ■ h(a ron x) h(a er ° n x) dm x . 

n=l ^ 

Since the function h is a.e. non-negative, using a uniform version of Furstenberg's multiple 
recurrence theorem (see e.g. [7]) we get that the limit ([22]) is bounded from below by a positive 
constant C2 that depends only on J h dmx = J fdfi (and is independent of tq). Combining 
this with dTTJ) and (JTHD, we get that 

M 



— ^ ( - ^ / / • T ro(Cr O' m+mn) /•...• r fro(Cr o- m+mn) fd/ij > c 2 - cie. 



lim inf 

M^oo M ^ ,„.,„ 

m=l n=l 



Since the positive constants c\ , C2 depend only on / we can choose e < c^jc\ and verify (fT6|) 
for r = rQ. 

To deal with the general case, we use an ergodic decomposition argument. For a.e. ergodic 
component we can use the previous argument to find an r G N for which (|16p holds. Since 
there are only countably many choices for r, there exists an ro £ N for which (|16p holds for a 
set of ergodic components that has positive measure. The result follows. □ 



5. Polynomial structure for Hardy sequences 

5.1. Result and idea of the proof. In this section we shall show that if the function a £ TC 
satisfies x k -< a{x) -< x k+1 for some non-negative integer k, then the range of the sequence 
[a(n)] contains some suitably chosen polynomial patterns. We are going to work with these 
patterns in the next section in order to prove Theorem A'. 
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Proposition 5.1. Let a G TL be eventually positive and satisfy x k -< a(x) -< x k+1 for some 
non-negative integer k. 

Then for every r G N, and every large enough to G N, i/iere exist polynomials p r ,rn{ n ) °f 
degree at most k — 1, and N r>m G N wnt/i N r>m — > oo (as m — > oo and r is fixed), such that 

{r(mn k + Prtm (n)),l <n< N,^ m } C {[a(n)]: n G N}. 

Remark. For r = /c = 1 we get the same patterns as in Theorems B and B'. For k > 1 the 
presence of the factors r is needed since a result analogous to Lemma 14.61 does not hold. 

The initial idea is rather simple. Let us illustrate it for r = k = 1. In this case x -< a(x) -< x 2 
and we are searching to find arithmetic progressions of the form 

{c m + ran, 1 < n < N m } 

within the range of [a(n)] for some c m ,N m G N with N m — > oo. Using the Taylor expansion of 
the function a(x) around an integer n m (to be specified later) we get 

(23) a(n m + n) = a(n m ) + a'(n m )n + a"(£„)n 2 /2 

for some £ n G [n m ,n m + n). Since a G TL is eventually positive and x -< a(x) -< x 2 , it is easy 
to see that 1 -< a!(x) -< x, a"(x) > 0, and a"{x) — ► (see Lemma [5.2p , Since a' G TL, the 
estimate for a' easily implies that the range of the sequence [a'(n)] is a cofinite subset of N 
(see Lemma l5.2p . Therefore, for large m there exists n m G N such that [a'(n m )] = m. Since 
n m — * oo, we have a"(£ n ) — * (also a"(£ n ) > 0). Therefore, if we could choose n m so that in 
addition to [a'(n m )] — ra we have that the fractional parts of the numbers a(n m ) and ti'(?i m ) 
converge to as m — ► oo, then (f23|) would give that 

[a(n m + n)] = [a(n m )j + [a'(n m )]n = c m + ran, c m = [a(n m )] 

for every n G [1, N m ], for some A^ m G N with N m — > oo. This is exactly what we wanted. 

To carry out this plan we shall need an equidistribution result that will enable us to get the 
"small fractional parts" assumption that we mentioned before. We are going to prove this by 
using a classical estimate of van der Corput on oscillatory exponential sums. 

5.2. Some basic properties. As we explained before, if a G TL and b G ££, then the limit 
lim x - >oc a'(x)/b'(x) exists (possibly infinite). Hence, if both functions a(x),b(x) converge to 0, 
or to oo, then by L'Hospital's rule we have \\m x ^, 00 a{x) /b{x) = lim x ^ 00 a'(x)/b'(x). We are 
going to make use of this fact to prove some basic properties of elements of TL that we shall 
frequently use: 

Lemma 5.2. Let a G TL be eventually positive and satisfy x k -< a{x) -< x k+1 for some non- 
negative integer k. Then 

(i) x k ~ l -< a( l \x) -< x k+1 ~ l for every I < k, x~ 1 ~ s -< a^ k+1 \x) -< 1 for every 5 > o]l and a^ l \x) 
is eventually positive for every I < k + 1. 

(ii) a^(x) is eventually increasing for < I < k, and decreasing for I = k + 1. 

(Hi) If k ^ for every non-zero c G K we have x k ~ l -< a(x + c) — a(x) -< x k . If k = 0, then 
a[x + c) — a(x) -< 1 and the range of [a(n]] is a cofinite subset o/N. 

(iv) For every c G M we have a(x + c)/a(x) 1 as x — > oo. More generally the same holds if 
c = c(x) is a bounded function. 



'Take a(x) — log a; or a(x) = log log a; and k = 0, / = 1, to see the necessity of introducing the term 5. 
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Proof, (i) All parts are a direct consequence of L'Hospital's rule. 

(ii) If / < k the result follows immediately from part (i). If I = k + 1 by part (i) we have 
that a,( k+1 \x) — > and a( fe+1 )(x) is eventually positive. It follows that o^ +1 ^(x) is eventually 
decreasing. 

(Hi) Let ^ 0. By part (i) we have that x fc_1 -< a'(x) -< x k . The result now follows from 
the mean value theorem. Suppose now that k = 0. By part (i) we have a!(x) -< 1 and the mean 
value theorem gives that a(x + c) — o(x) -< 1. Since a(n) — > +oo and a(n + 1) — a(n) — > it 
follows that the range of the sequence [a(?t)] is a cofinite subset of N. 

(iv) Notice that a(x + c) = a(x) + b(x) where b(x) = a(x + c) — a(x) -< a(:r) by part (Hi). It 
follows that a(x + c)/a(x) — > 1. Suppose now that c = c(x) satisfies \c(x)\ < M. Since a(x) is 
eventually positive and increasing, we have a(x — M)/a(x) < a(x + c(x)) / 'a(x) < a(x + M)/a(x) 
for all large enough x. The result now follows from the case where c(x) is constant. □ 

Given some growth estimates for a G 7i we shall derive some estimates about the composi- 
tional inverse a -1 of a (which is not necessarily in 7i). 

Lemma 5.3. Let a G TL be eventually positive and satisfy x s -< a(x) -< x for some 5 G (0, 1). 
Then 

(i) x -< a~ 1 (x) -< x 1 / 5 . 

(ii) (a~ 1 )' is eventually increasing, 1 -< (a~ 1 )'(x) -< x 1 ^^ 1 , and (a~ 1 )"(x) -4 1. 
(Hi) For every cGiwe have (a _1 )'(x + c)/(a~ 1 )'(x) — > 1 as n do. 

Proof, (i) By Lemma 15.21 we have that a is eventually increasing, so the same is true for a" 1 . 
Hence, our hypothesis gives a~ 1 (x s ) -< x -< a~ 1 (x), which implies the advertised estimate, 
(ii) By Lemma 15.21 we have that a' is eventually decreasing. Since 

(24) (a- l )'(x) = 1 

a (a (x)j 

and a -1 is eventually increasing, it follows immediately that (a~ 1 )' is eventually increasing. 
Using L'Hospital's rule we get 

(25) x- 1+s -< a'(x) -< 1. 

Combining (|24p . (|25p . and part (i), we get the first estimate, and similarly we deal with the 
second. 

(Hi) Using the estimates in (ii), the proof is the same as in part (Hi) and (iv) of Lemma [5. 2 i 

□ 

5.3. An equidistribution result. In this subsection we shall establish the equidistribution 
result needed for the proof of Proposition 15.11 We first state and prove it in its simplest form 
(Lemma I5.6P , and subsequently we prove a more technical variation (Proposition 15. 7j) that is 
better suited for our purposes. 

The following estimate is crucial for the results in this subsection. The proof can be found 
in [2U (Theorem 2.7). 

Lemma 5.4 (van der Corput [47J). Let k, I be integers with k < I and let f be twice differentiate 
on [k, I] with f"(x) > p > or f"(x) < -p < for x G [k, I]. 
Then 

J2e(f(n))<(\f>(l)-f(k)\+2)(±+3). 

n=k 
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Definition 5.5. Let (/ m )meN be a sequence of intervals of integers (with lengths going to oo). 
We say that the sequence a(n) with values in R d , is equidistributed in T d with respect to the 
intervals I m , if for every Riemann integrable function <fi: T d —> C we have 



lim — — r (f>(a(n)) = I cf> dm T d. 



■ftd 



As it is well known, it suffices to verify the previous identity for every non-trivial character 
cf) = X of T d (in which case the integral is 0). 

Lemma 5.6. Suppose that a G TL is eventually positive and satisfies x -< a(x) -< x 1 . 
Then for every e > there exists a sequence of intervals (iin)meN such that 

(i) m < a'{n) <m + e, for every n G I m and m large enough, and 

(ii) the sequence a(n) is equidistributed in T with respect to the intervals I m . 

Proof. Let e > 0. Suppose for the moment that k m ,l m G N have been chosen so that the 
intervals I m = [k m ,l m ], satisfy condition (i). Let us see how we deal with condition (ii). We 
need to guarantee that for every non-zero integer s we have 

(26) lim — e(ao(n)) = 0. 

Tl — Km 

To estimate the average in (j26|) we are going to use Lemma l5,4i Since m < a'(n) < m + e for 
n G I m , we have \a'(l m ) — a'(k m )\ < s, and since a"(x) is eventually decreasing (by Lemma [572]) 
we have for large m that p = a"(l m ) satisfies the assumptions of Lemma 15.41 We get 

1 ^ , , ,, 2 + es ( 4 
— e Mn)) < -j — • ^=== + 3 

'm — Km I'm — K W SO U m J 

Therefore, in order to establish ([2"6]) it suffices to show that 

(27) lim 1 = 0. 

m^oo (J m _ /j m )^/ a"(l m ) 

We shall now make a choice of fc m , Z m G N so that conditions (i) and (|27p are satisfied. Notice 
that Lemma 15.21 implies that a'(x) increases to +oo. We consider two cases: 

Case 1. Suppose that a(x) -< x 1+s for every 5 > 0. Let k m be the first integer such that 
a'(n) > m, and define 

3 

Im = [km i k m + km\- 

We first show that condition (i) is satisfied. Choose any 5 G (0,1/4). Since a(n) -< n 1+<5 , 
arguing as in Lemma 15.21 we get a"(n) -< n _1+<5 . Using the mean value theorem and the fact 
that a'(x) is eventually increasing (by Lemma l5.2|) we get 

3 3 3 

(28) max 4 (a'(fc m + n) - a'(k m )) = a'(k m + k m ) - a\k m ) = k m ■ a"(£m) -< kl ■ k~ 1+s 0. 

Furthermore, since a'(n + 1) — o'(n) — > (by Lemma l5.2p . form the definition of k m we have 
that a'(k m ) ->masm^oo. From this and (|28l) it follows that condition (z) is satisfied. 
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It remains to verify (|27p . Since a(n) >- n we get by Lemma 15.21 that a"(n) y n 1 5 for every 
5 > 0. Using this, and keeping in mind that 5 < 1/4 we find that 

^ l+<5 

I j (km + k m ) 2 j j^— 1/8 > 



^ — -< 



1 in 



(l m km) a" (Jm) 

This proves (f2T|) and completes the proof of Case 1. 

Case 2. Suppose that x 1+<s -< a(x) -< x 2 for some <5 > 0. Using Lemma 15.21 we find that 
x s -< a'(x) < x so we can apply Lemma 15.31 for a' in place of a. For convenience we set 

b(x) = (a'y^x) 

and summarize some properties that follow from Lemmas 15.21 15.31 and will be used later 

(29) a (n + 1) — a'(n) — > 0, b (n) increases to do, b (n + c)/b (n) — > 1 for every c£l. 
We define 

I m = {n G N: m < a! (n) < m + e} = [k m , l m ]. 

Obviously, condition (i) is satisfied, therefore it remains to verify (I27j) . Since a'(n + l) — a'(n) — > 
(by (|29l) ) we get from the definition of fe m that a'(k m ) — m — > 0. It follows that b(m) — k m — > 0, 
and similarly we get b(m + e) — Z m — ► 0. Since 6' is eventually increasing (by (|29l) ). using the 
mean value theorem we get for large m that 

(30) Z m - k m > - ■ (b(m + e) - 6(m)) = | • 6'(£ m ) > | • b'(m). 

Furthermore, since b'(x) = ^w^n^ , or equivalently a"(b(x)) = ^jy, setting x = m + e and 
using that b(m + e) — l m — > gives 

~ TJ7~~ — V ~* °- 
o'(m + ej 

It follows that for large m we have 

1 



< 31) ""< , ™>^(m + E ) 

Combining (|30p and (|3ip we get for large m that 



1 < c V&V + g _ c \V(m + e) 1 



where C2 = 2 3//2 /e. The last expression converges to zero as m — > 00 since the first fraction 
converges to 1 and b'(m) — > 00 (by (j29l) ). Hence, we have established (127]) . completing the 
proof of Case 2. 

Since the two cases cover all the functions a G 7i that satisfy x -< a(x) -< x 2 the proof is 
complete. □ 

We now derive an extension of Lemma 15.61 that will be used later. A big part of the proof is 
analogous to that of Lemma 15.61 so we are just going to sketch it. 

Proposition 5.7. Suppose that a G 7~C is eventually positive and satisfies x k -< a(x) -< x k+l 
for some k G N. Let e > and do, d±, . . . , dk G N. 

JTien i/iere exists a sequence of intervals (I m )m<=N such that 
(i) dkm < (n) < d^m + e, for every n G I m and m large enough, and 
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(ii) the sequence (a(n)/do, a' (n)/di, . . . ,a^ k 1 ^(ra)/d/ c _i) is equidistributed in T k 1 with respect 
to the sequence of intervals I m . 

Proof. We assume that d$ = d\ = . . . = d^ = 1, the general case is similar. As in the proof of 

Lemma we define the sequence of intervals I m = [k m , l m ] as follows: 

Case 1: If a(x) -< x k+& for every 5 > (in which case a^ k \x) -< x s for every 5 > 0), then 

I m = [k m , k m + kfl^] where k m is the smallest n G N such that a^ k \n) > m. 

Case 2: If a(x) y x k+s for some 5 > (in which case x s -< a^ k \x) -< x for some 5 > 0), then 

I m = {n\ m < cS k \n) < m + s} . 

Arguing as in Lemma 15.61 we can show that condition (i) is satisfied. Therefore, it remains 
to show the equidistribution property (ii), or equivalently, that for Co, . . . , c& G Z, not all of 
them zero, the sequence 6(n) defined by 

6(n) = coa(n) + c\a (n) + . . . + Cfc_ia ( -' c_1 ^(n) 

is equidistributed in T with respect to the sequence of intervals I m . To do this we shall use a 
difference theorem of van der Corput (Theorem 3.1 in [34]) which enables us to reduce matters 
to a setup similar to the one treated in Lemma 15.61 (a similar trick was used in |16j). We are 
going to assume that cq ^ 0, the other cases can be treated similarly. 

By the theorem of van der Corput, in order to show that b(n) is equidistributed in T with 
respect to the sequence of intervals I m , it suffices to show that for every m 6 N the sequence 
A m b(n), where A m b(n) = b(n + m) — b(n), is equidistributed in T with respect to the sequence 
of intervals I m . Applying this successively we reduce our problem to showing that for every 
mi, . . . , rrifc_i G N the sequence B(n), where the function B G TL is defined by 

B(x) = A mi A m , 2 ■ ■ ■ A mk l b(x), 

is equidistributed in T with respect to the sequence of intervals I m - 

In order to prove this, we first derive some properties about the function B(x) that will be 
useful. Since x k -< b(x) -< x k+1 , by repeatedly applying Lemma EH we get 

(32) x -< B(x) -< x 2 . 

It will also be useful to relate the functions B'(x) and cS- k \x). By repeatedly applying the 
mean value theorem and using that a^^^x) — > (follows from Lemma l5.2p we get 

(33) lim (B'(x) - Ma {k) (x + &.)) = 

X— >oc 

where M = cqtui ■ ■ ■ m^_\, for some £ x G R that satisfies < £ x < mi + . . . + m^-i- Moreover, 
since a^ k \x) -< x and £ x is bounded, we get by Lemma I5T21 that 

a {k \x + £ x )-a {k \x) ->0. 

Combining this with (|33p we get 

(34) lim(B'(x)-Ma (k) (x)) = 0. 

x— >oo 

Using (|32p and arguing as in the proof of Lemma 15.61 we get that the sequence B(n) is 
equidistributed in T with respect to the sequence of intervals J m that are chosen as follows: 

Case A. If B(x) ~< x 1+s for every 5 > 0, then we can set J m = I m (because of (j34]) it is 
easy to verify that this choice works). Therefore, in this case we are immediately done. 
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Case B. If B(x) >~ x 1+s for some 5 > 0, we can choose 

Jm = {n: rm + e(n) < B'{n) < r • (m + e) + e(n)} 

where r is any positive real number and e(n) is any sequence that converges to 0. Our objective 
is to choose r and e(n) so that J m = I m . We choose r = M and e(n) = B'{n) — Mb^ k \n) 
(which converges to by (|34l) ) . In this case we have that 

J m = {n: Mm < Mb (k) (n) < M(m + e)} = I m . 

Therefore, in both cases we get the required equidistribution property This completes the 
proof. □ 

5.4. Finding the polynomial patterns. We will now complete the proof of Proposition [5J3 
First we use Proposition 15.71 to derive some more useful results for our particular setup. 

Lemma 5.8. Suppose that a G 7i is eventually positive and satisfies x k -< a(x) -< x k+1 for 
some k G N. Let r, m G N and e > 0. 

Then there exist n r ^ m G N with n r ^ m — > oo (as m — > oo and r is fixed), such that for all large 
m we have 



a v >{n r 



k\ 



rm, 



raW(n rm ) 



(mod r) 



0,...,fc-l, 



' (flr,m) 



| < e, i = 0, . . . , k. 



Proof. Let < e < 1 and r G N be fixed. By Proposition 15.71 the sequence 

'a(n) a'[n) a( fc_1 )(n) 



b{n) 



r ■ 1! 



'r-(fc-l)!. 

is equidistributed in T fc_1 with respect to the intervals 

Im = {n G N: rmfc! < a^ k \n) < rmk\ + e}. 

Hence, for large enough m there exists an n rm G N such that b(n r , m ) G [0, f] fc_1 . The result 
follows by noticing that {x/r} < 1/r implies that [x] = 0(mod r), and the estimate {x} < 
r{x/r}. □ 

Lemma 5.9. Suppose that a G 7i is eventually positive and satisfies x k -< a(x) -< x k+1 for 
some k G N. 

T/ien i/tere exist e ryTn G K, n rj?n G N, with e rm — > 0, n. rjm — > oo (as m — > oo and r is fixed), 
such that for all large m we have 



^(?^r,m) 



A;! 



rm, 



= (mod r) 



0,...,fc-l, 



Proof. Let r G N. For e = l//c there exist M r j. and n riTn (k) such that the conclusion of 
Lemma 15.81 is satisfied for every m > M r fc. For M rj fc < m < M r k+i, let e rjm = 1/k and 
n r,m = n r,m(k). Thus defined, the sequences s r>m ,n r>m satisfy the conclusions of our lemma for 
every m > M r \. □ 



We are now ready to prove Proposition 15.11 
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Proof of Proposition I5.il If k = the result follows immediately from Lemma 15.21 Suppose 
that k > 1 and let n r ^ m be as in the statement of Lemma 15.91 Using the Taylor expansion of 
a(x) around the point x = n r m we get for n G N that 



(35) a(n r ^ m + n) = a(n r ^ m ) + na'(n r ^ m ) + 



n 



+ — a (fc) K,m) + 



n 



k+l 



,(*+!) 



(£r,n) 



(fc + 1)! 

for some £ rjn G [n r)m ,n rjm + n]. By Lemma [5.21 we have a ( - k+1 \x) — ► as x — > oo (also 
a^ k+1 \x) > 0). Furthermore, we have that j a< K-.m) | ^ g ^ for £ = 0, . . . , fc, where e r ^ m — > 
as m — ► oo. It follows that there exist integers N r m such that N rm — * oo and for every 
1 < n < N rm and all large m G N we have 



{a(n r , m )} + ra{a'(n r , m )} + . . . + n fc | ^p^) + n { (fc + i) 

(1^ ^ (^r,m 



6v0i < l 

1)! J - 2" 



For these values of m and n, equation (|35p gives 

[a(n r , m + n)] = [o(n r m )] + n[a'(n rm )\ + . . . + n k 



Remembering that chosen to also satisfy 

(mod r), £ = 0, . . . , k — 1, we get for 1 < n < iV r)m that 



« (fc) ( n r,m) 



A:! 

rm and 



[a(n r , m + n)} = r(c , r ,m + ci >r , m n + . . . + c k _ ljrtm n k 1 + mn') 



for some Ci t r,m G N, £ 



0, . . . , k — 1. This proves the advertised result. 



= 



□ 



6. Multiple recurrence for Hardy field sequences 

In this section we shall prove our main result which we now recall: 

Theorem 6.1. Let a G 7i satisfy x k -< a(x) -< x k+1 for some non-negative integer k. 
Then S = {[a(l)], [a(2)], . . .} is a set of multiple recurrence. 

Using Proposition 15 . 1 1 we see that for elements of H that are eventually positive Theorem A' 
is an immediate consequence of the following result (the case of eventually negative elements 
of 7i can be treated similarly): 

Theorem 6.2. Suppose that for every r, m G N the set ScN contains patterns of the form 

{r(mn k +p T , m {n)), 1 < n < iV nm } 

where p r ,m( n ) is an integer polynomial of degree at most k — 1, and N rm G N satisfy N rm — ► oo 
(as m — > oo and r is fixed) . 

Then S is a set of multiple recurrence. 

The rest of this section is devoted to the proof of Theorem 16.21 

6.1. Proof of Theorem 16.21 modulo a technical result. Our argument is similar to the 
one used to prove Theorem B' and is carried out in two steps. We first show that it suffices to 
verify a certain multiple recurrence property for nilsystems, and we then verify this property 
using an equidistribution result on nilmanifolds. 

The reduction to nilsystems step is a direct consequence of Theorem 12.31 and the following 
result which serves as a substitute for Proposition [ 
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Proposition 6.3. Let (X,B, fj,,T) be a system and fo, fi, ■ ■ ■ , fe £ L°°(ji) be such that fi-LZ 
for some i = 0, 1, . . . , £, where Z is the nilfactor of the system. For fixed k,r £ N consider the 
polynomials p m {n) = r(mn k + q m (n)) where deg(q m ) < k — 1, m £ N, and let (N m ) m< =^ be a 
sequence of positive integers with N m — ► oo. 
Then 

1 M JV m . 

M— >oo ikZ ^— - ' I I\ m ^ J 

m=l n=l 

The verification of the multiple recurrence property for nilsystems is based on the following 
equidistribution result which serves as a substitute for Proposition I4.5I 

Proposition 6.4. Let X = G/Y be a connected nilmanifold and let a £ G be an ergodic 
nilrotation. Suppose that (q m )me~N is a sequence of integer polynomials with degree at most 
k — 1, and let (N m ) m <=fq be a sequence of positive integers with N m — > oo. 
Then for every F G C (X) we have 

1 M / 1 Nm r 
lim — V f — V F(a mnk+qm( - n) T)) = / F dm x . 

m=l n=l 

Proof. The argument is identical to the one used to prove Proposition ^. 51 □ 

Proof of Theorem 1 6. Using Proposition 16.31 and Proposition 16.41 the argument is identical to 
the one used to finish the proof of Theorem B' (Section I4.2[) . □ 

6.2. Proof of Proposition [6731 The proof of Proposition 16. 31 is carried out in two steps. First 
we establish an estimate about general polynomial families using an inductive argument that is 
frequently used when one deals with multiple ergodic averages along polynomial iterates. We 
then apply this estimate to show that the ergodic averages we are interested in are majorized by 
some multi-parameter polynomial ergodic averages. A known result shows that these averages 
are controlled by nilsystems, so the same should be the case for our averages. 

6.2.1. A PET induction argument. We start with some notational conventions that we use 
henceforth: If h = (h\, . . . ,h r ), H = (H\, . . . , H r ), when we write 1 < h < H we mean 
1 < hi < Hi for i = 1, . . . , r, and when we write \H\ we mean H\ ■ . . . ■ H r . With o^(l) we 
denote an expression that converges to zero when h\, . . . ,h r — > oo, and for N £ N we denote 
by o/j j 7v,h^Ar(l) a quantity that goes to if hi, N — > oo and hi/N — ► 0. 

We briefly review some notions from [32J (most of which where introduced in [3j). We say 
that a property holds for almost every h £ Z r if it holds outside of a subset of Z r with upper 
density zero. We remark that the set of zeros of any non-identically zero polynomial p: TU — > Z 
has zero upper density. If ao, a\, . . . , : HI -^>7L are integer polynomials, with not identically 
zero, we call a function p: Z r+1 — * Z defined by p{h,n) = ak{h)n k + • • • + a\(h)n + ao(h) an 
integer polynomial with r parameters and degree k. If p(h, n) has degree k, then for almost 
every h € Z r , the degree of the polynomial p(h,n), with respect to the variable n is k. A set 
V = {pi(h, n), . . . ,Pk(h, n)}, where Pi(h, n) are integer polynomials with r parameters, is called 
a family of integer polynomials with r parameters. The polynomials in V are non-constant if 
they all have positive degree, and essentially distinct if all their pairwise differences have 
positive degree. The maximum degree of the polynomials is called the degree of the polynomial 
family and is denoted by deg('P). Given a polynomial family V with several parameters, let 
Vi be the subfamily of polynomials of degree i in V . We let W{ denote the number of distinct 
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leading coefficients that appear in the family V%- The vector (d,Wd, ■ ■ ■ ,wi) is called the type 
of the polynomial family V . 

We shall use an induction scheme, often called PET induction (Polynomial Exhaustion Tech- 
nique), on types of polynomial families that was introduced by Bergelson in [3]. To do this we 
order the set of all possible types lexicographically, this means, (d, u>d, ■ ■ ■ , w%) > (<f, w' d , . . . , w\) 
if and only if in the first instance where the two vectors disagree the coordinate of the first 
vector is greater than the coordinate of the second vector. 

Lemma 6.5. Let V = {pi, . . . ,pk} be a family of non-constant essentially distinct polynomials 
with r parameters such that degiV) = deg{p\), (X, B, fj,, T) be a system, and fi,---,fk G L°°(fj,) 
with \\fi\\ La0 Qj) <lfori = l,...,k. 

Then there exist s G N, depending only on the type ofV, and f G N depending only on r and 
the type ofV, a family of non-constant essentially distinct linear polynomials {qi, . . . , q s } with 
r+f parameters, and functions gi, g s G L°°{^) {independent of h and h) with \\gi\\ L oc,^ < 1 
and gi = f\, such that for every h S Z r we have 



(37) 



1 N 

_ ^2 T pi< ^' n ) fi ■ T Pk ( h ' n ^ fk 



n=l 



\H\ 



E 

Kh<H 



N 



N 



rpgi(h,h,n) m _ rpq s (h,h,n) 



n=l 



+ o 



N,H,H-<N 



(1) 



L 2 M 



where H = 
family V . 



(Hi, . . . , Hf ), and the implied constant depends only on the type of the polynomial 



Proof. We remark that throughout the proof all the implied constants depend only on the type 
of the polynomial family V . 

We shall use induction on the type of the polynomial family V . Assume that the statement 
holds for all polynomial families with several parameters and type less than (d,Wd, ■ ■ ■ ,w\), and 
suppose that {pi, . . . ,p^} is a polynomial family with r parameters and type (d,Wd, ■ ■ ■ ,wi). 
Let do be the first positive integer for which Wd ^ 0. Without loss of generality we can assume 
that the polynomial pk has minimal degree and is such that the polynomials Pi(h, n) —pk(h, n), 
Pj(h, n + hi) — Pk(h, n), for i = 1, . . . , k — 1, j = 1, . . . , k, have degree less than or equal to the 
degree of the polynomial pi(h,n) — pk(h,n). Furthermore, we can assume that deg(pi) > 2, 
otherwise all polynomials are already linear (since p\ has maximal degree) , in which case there 
is nothing to prove. In order to carry out the inductive step we consider two cases. 

Case 1. Suppose that deg(pfc) = do > 2. Let f G N be the integer that is determined by the 
induction hypothesis. We are going to estimate 



(38) 



1 



N 



N ^ 

71=1 



rpp 1 (h,n) p _ _ j,p k (h,n) 



2 r+i 



We use Lemma 14.11 then factor out the measure preserving transformation T Pk ^ h,n ' from the 
resulting integrals, and use the Cauchy Schwarz inequality. Since the sup norm of all functions 
is bounded by 1, we find that for every h G 27 the expression in (|38j) is bounded by some 
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constant times 



(39) 



Hi N 



Hi ' iV 

hi =1 n=l 



+ OAf,/fi,Hi^Af(l), 



L2( M ) 



where l = 2k — l,fi = fi, pi(h\, h, n) = n) —p k (h, n), the functions fyi ■ ■ ■ , fi are bounded 
by 1 and do not depend on the parameters hi, h, and the polynomials {p2, ■ ■ ■ ,Pi} have the 
form 

Pi(h,n) - p k (h,n), or pj(h,n + hi) - p k (h,n) 

for some i = 2, . . . , k — 1 and j = 1, . . . , k. It is easy to verify that {pi, . . . ,pi} is a family of 
non-constant essentially distinct polynomials with r + 1 parameters, type strictly smaller than 
the type of the family V, and the polynomial f>\ has maximal degree. Therefore, we can apply 
the induction hypothesis to give a bound for the norm that appears in (|39j) . Putting these 
estimates together we produce the desired bound, completing the inductive step. 

Case 2. Suppose that deg(pfc) = do = 1. After possibly rearranging the polynomials 
pi, ■ ■ ■ ,Pk-i we can assume that deg pi = 1 if and only if i > ko, for some ko that satisfies 
2 < ko < k. Notice that since the polynomials pi are linear for i > ko, for every hi G N we have 



(40) Pk{n+hi)-p k (n) = p k (hi), Pi{n+hi)-p k (n) = Pi{n)-p k (n)+pi(hi) for i = k ,. 
Arguing as in Case 1 and keeping in mind the identities (I40p we get the estimate 

2 

//, < 



,k-l. 



(41) 



1 N 

__ ^ rppi{h,n) J x . . rpPk{h,n) ^ 



n=l 



VQt) 



H 



E 

hi=l 



1 ^ 

rppi{h lt h,n) _ _ rj,pi{hi,h,n) ^ 



n=l 



+ 0Ar,i2i,i2i^Ar(l), 



where I = k — ko — 2, fi = f\, pi(hi, h, n) = pi(/t, n) — Pfc(/i, w), and the polynomials p2, ■ ■ ■ ,p\ 
have the form 

Pi(h,n) - p k (h,n), or pj(h,n + hi) - p k (h,n) 

for some i = 2, . . . ,k — 1 and j = 1, . . . , &o — 1. It is easy to verify that {pi, . . . ,pi} is a family 
of non-constant essentially distinct polynomials with r + 1 parameters and type strictly smaller 
than the type of the family V . 

Note that in this case some of the functions fi may depend on hi, but this can happen 
only for those indices % for which deg(pj) = 1. In order to get rid of these functions we use 
Lemma 14.11 again to get a bound for the expression 



1 N 

rpp~i(hi,h,ri) . . rppi{hi,h,n) 



n=l 



that involves one less linear term. After repeating this step a finite number of times (wi — 1 in 
total) , we eventually get an expression without any linear terms (see Example [7]) . Combining 
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this with the estimate (|4ip we get 



(42) 



1 N 

rppi(h,n) ^ _ _ rpp k (h,n) 



n=l 



2 u 'i 



L2( M ) 



l#l 



Kh<H 



N 



N 



n=l 



+ o 



N,H,H-<N 



(1) 



£ 2 M 



for some /i E N, where /i = (h±, . . . ,h wi ), H = (Hi, . . . , H Wl ). The family of non-constant 
essentially distinct polynomials {pi, ... ,^^1 has r + wi parameters, type strictly smaller than 
the type of the family V , and the functions fi, ■ ■ ■ , are bounded by 1 and do not depend on 
the parameters h, h. We can now use the induction hypothesis, as in Case 1, to carry out the 
inductive step and complete the proof. □ 

We illustrate the method used in the previous proof with the following example: 

Example 7. We start with polynomial family {n, In, n 2 } that has type (2, 1, 2) and study the 
corresponding multiple ergodic averages 

N 



^Y,T n2 fi-T 2n f 2 -T n h 



n=l 



Since this expression involves linear terms, we perform the operation described in Case 2. We 
are led to study an average over hi of a power of the I? norms of the averages 

1 N 

n=l 

which involves a polynomial family with 1 parameter that has type (2, 1, 1). We perform one 
more time the operation described in Case 2. We are led to study an average over hi, /12 of a 
power of the L 2 norms of the averages 

1 N 

^ ^ rp(n+hi+h2) 2 ~2n~h2 _ rp(n+hi) 2 — 2n y _ rp(n+h2) 2 — 2n~ h2 j _ rpn 2 —2n 
^ n=l 

which involves a polynomial family with 2 parameters that has type (2, 1, 0). Since the resulting 
expression has no linear terms, we perform the operation described in Case 1. We are led to 
study an average over hi,h2, /13 of the I? norms of the averages 

1 N 

— \^T 2 ( h -L+ h 2+h3)n+(h 1 +h2+h- i ) 2 ~h2-2h 3 J . j, 
N ^ 



2(h 1 +h 2 )n+(h 1 +h 2 ) 2 -h2 



r 



n=l 



rp2(h 1 +h3)n+(h 1 +h 3 ) 2 -2h3 ^ . rp2hm+h 2 J^ _ rp2(h2+h3)n+(h2+h3) 2 -h2-2h3 

rp2h2n+h 2 — h2~J _ rj-t2h3n+h 2 ^—2h3~J 

which involves a family of linear polynomials with 3 parameters that has type (1,7). 

Next we use the previous lemma to bound some multiple ergodic averages involving a col- 
lection of polynomials of a single variable with some multiple ergodic averages involving a 
collection of polynomials of several variables that have some convenient special form. 
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Lemma 6.6. Let V = {pi, . . . ,pk} be a family of non- constant essentially distinct polynomials, 
(X,B,fJ,,T) be a system, and f , ft,. . . , f k G L°°(n) with \\fi\\ L oo {p) < 1. 



Then there exist r, s G N, non- constant essentially distinct polynomials P\ 



P s :77 



Z i/ioi are independent of n and each P{ is an integer combination of polynomials of the 
form Pi(n + ^jeJ^i) w ^ ere J * s some subset (possibly empty) of {1, . . . , r}, and functions 
Fq,Fi, . . . ,F s G L°°(h), such that F± = f\, and 

N r 

I^E //o ■ • • • • • Z* k(n) /fc d/xf « 

n=l •* 

W\ E I / F ° ' rPlWi? i • • • • • T P 'WF S dn\ + o NjH;H ^ N (l) 

l<h<H J 

where H = (H\, . . . , H r ) and the implied constant depends only on the type of the polynomial 
family V . 

Remark. Our assumptions force the polynomials P±, . . . ,P S to have very special form, we are 
going to take advantage of this property later. 

Proof. We remark that throughout the proof all the implied constants depend only on the type 
of the polynomial family V . 

We are going to estimate the quantity 

A' 



^E / h-T^h-...-T^f k d^ 

n=l •* 



for some appropriate choice of r. We can assume that the polynomial p\ has maximal degree. 
Indeed, if this is not the case, we can factor out the measure preserving transformation T Pi °( n ) 
where pi a is some polynomial of maximal degree and work with the resulting family. 

Using the Cauchy Schwarz inequality and Lemma 16.51 we get that there exist r±,si G N, 
depending only on the type of V, a family of non-constant essentially distinct linear polynomials 
qi,...,q si with r% parameters, and functions fx, ■ ■ ■ ,f si G L°°(fi), such that f\ = f\ and 



i i r 

(43) \-J2 / /o • T^h ..... T^f k a> 

n=l J 



2 r i 



1 1 N 



\H\ ^ N 

1 1 Kh<H n=l 



+ on,h,h^n(1) 



where H = (Hi, . . . , H ri ). Since all the polynomials are linear, arguing as in the proof of 
Lemma 14.21 we get that there exist r^,S2 G N, non-constant essentially distinct polynomials 
Pi, ... , P S2 : 27 2 — ► Z, and functions Fq, . . . , F S2 G L°°(/z), such that F\ = f\ = fi, and 



(44) 



1 N 

ygi(M) f x . , , , . j"?si wO f si 



n=l 



L2( M ) 



\H\ 



Kh<H 



£ / F ■ t^'Vf, ..... dn 



+ o 



N,H,H-<N 



(1) 
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where H = (Hi, . . . , H r2 ). Combining (03]) and (HH), and noticing that {Px(h, h), . . . P S2 (h, h)} 
is a family of non-constant essentially distinct polynomials Z ri+r2 ->Zwe get the advertised 
estimate with r = r\ + r 2 . Furthermore, looking at the proof of Lemma [63] and Lemma [4.2l we 
see that the polynomials Pi are constructed starting from the family {px, . . ■ ,Pk} and performing 
r times one of the following two operations: (i) form the polynomial p(n) — q(n) or (it) form the 
polynomial p(n + hi) — q(n), where p and q are already defined polynomials and i G {1, . . . , r}. 
It follows that the polynomials Pi have the advertised form. This completes the proof. □ 

6.2.2. Conclusion of the reduction. We shall now use Lemma 16.61 to show that the nilfactor is 
characteristic for the multiple ergodic averages related to the multiple recurrence problem of 
Theorem 16.21 We first need a simple lemma: 

Lemma 6.7. Fix k £ N. Let (p m )meN be a sequence of integer polynomials of the form 
Pm{n) = mn* + q m (n) where deg(q m ) < k — 1 and for li, hi GZ let 

r 

Pm, ni ,...,n r (n) = ^liPmin + hi). 
1=1 

Then the leading coefficient of P m ^ ... h T ( n ) has the f orm m • P(hi, • • ■ , h r ) for some polyno- 
mial P : Z r -»Z. 

Proof. For every choice of integers h,ni, and positive integer j with j < k, the degree of the 
polynomial Yli=i h + hi) k is greater than the degree of the polynomial Yll=x h + hiY ( U P to 
a constant, we get the second polynomial by differentiating the first several times), as long as 
the two polynomials are not identically zero. Hence, the leading coefficient of the polynomial 
Pm,hi,...,hr( n ) 1S the same as the leading coefficient of the polynomial m ■ Yll=x h ( n + hi) '■ The 
result follows. □ 

Proof of Proposition \6.3l We remark that throughout the proof all the implied constants de- 
pend only on the type of the polynomial family V . 

Without loss of generality we can assume that H/iH^oa/^ < 1 for i = 0, 1, . . . ,£, and fi-LZ. 
Let 

1 M 1 Nm f 
A M = E \JT £ / /o ' TPm(n) A • T 2p -^f 2 ■■■■■ T e ^f e dp, 

m=l n=l 

We shall use Lemma I6.6I to get a bound for the averages Am that is independent of the 
polynomials q m (remember p m (n) = r(mn k + q m {n))). 

Using the Cauchy-Schwarz inequality we have for every i 6 N that 

i M i Nm r 2 
(45) | A M f < Y.krY. *>' TPm(n) h ■ T 2 ^f 2 T^Wft dii 

771=1 771 71=1 •* 

If r is the integer given by Lemma [6. 61 letting t = r we have that there exist s £ N, non-constant 
essentially distinct polynomials P m ,i, ■ ■ ■ , Pm,s '■ — > Z, and functions Fq, . . . , F s £ L°°(p), 
such that Fx = fx, and 



(46) 



1 Nm p .->/ 

Im n=l ^ 

O E | / F ° • ^Wji • • • • • T P ^F S dp\ + o Nm>Hm , Hm ^ Nm (l) 



2 l 



Kh<H„ 



:-!2 



where H m = (H m i, . . . , H m f). By Lemma I6.6| the polynomials P m \, . . . , P m s have the form 
X)*=i hPm( n + ^i) where /i^ = X^eij m fo r some subsets Ij iTO of {1, ... , £}. Since the polyno- 
mials P TO) i, . . . , P m ,s are constant in n, and p m {n) = r(mn k + q m (n)) for some g m £ Z[x] with 
degq m < k — 1, we get from Lemma [6771 that 

P m ,i(n) = rm- Pi(h), 

for i = l,...,s, for some non-constant essentially distinct polynomials Pi...,P s : Z* — > Z. 
Keeping this in mind, and putting together (j45[) and (|46p we find that 



^Ml 2 ' « I / F • T rmPl ^Fi ..... T rmP ^F s dp + M ,H m ,H m ^iV m (l 



where 



<f> M = {(m, h) G N* +1 : 1 < m < M, l<h<H m }. 



We can choose H m such that H m /N m — > and ($m)msN forms a F0lner sequence of subsets 
of N' +1 . Since iq = fi-LZ, using Corollary 12.21 we get that the last expression converges to 
zero when M — ► oo. This shows that ^4m converges to zero in L 2 (/i) as M — > oo and completes 
the proof. □ 



7. Appendix: Single recurrence for Hardy field sequences 

In this last section we deal with single recurrence properties of Hardy field sequences and 
improve upon the single recurrence versions of Theorems A and A'. 

Sarkozy Q44J), and independently Furstenberg ([25]). showed that if a non-constant poly- 
nomial q £ 1i[x] has zero constant term, then the sequence q(n) is good for single recurrence. 
More generally, the same is true for (non-constant) sequences of the form [q(n)] where q G R[x], 
with q(0) = ([!]). If the constant term of the polynomial q G M[x] is non-zero, then the se- 
quence [q{n)\ is still good for single recurrence, provided that q is not of the form q = cp + d 
for some p G Z[x] and c,rf£l. In this case determining whether [q{n)\ is good for recurrence 
depends on intrinsic properties of the polynomial g@ For example, the sequences 3n + 3, n 2 -l, 
[y/bn + 1], [y/bn + 3] are good for recurrence, but the sequences 3n + l, n 2 + l, [Vbn + 2], are 
bad for recurrence. 

We shall show that if the function a G TC has polynomial growth and stays away from 
polynomials of the form cp(x), where p G Z[x] and c£l, then the sequence [a(n)] is always 
good for single recurrence. This is the statement of Theorem C which we now repeat. 

Theorem C. Let a £ TC have polynomial growth and suppose that \a(x) — cp{x)\ — ► oo for 
every p G Z[x] and c G M. 

Then S = {[a(l)], [a(2)], . . .} is a set of single recurrence. 

We are going to use the following lemma: 

Lemma 7.1. Suppose that p G K[x] is non-constant with leading coefficient a, the real number 
(3 is such that 1//3 ^ Q/a + Q, and n m , N m G N are swc/i that N m — > oo as m — > oo. 



It can be shown that the sequence g(n), where g £ Z[a;], is good for recurrence if and only if the set 
{q(n) : n G N} contains multiples of every positive integer ( 33 ), and the sequence [an + 6], a, b £ R, is good for 
recurrence if and only if there exists an integer k such that b + ka € [0, 1] ([!]). 
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Then for every t G (0, 1), and Riemann integrable function <p: T — » C, the averages 



mIZ(^T S 0(p(n) + m/3)-e(b(n)+m/3]t)J 



m=l n=n„ 

converge to zero as M — > oo. 

Proof. Suppose first that i is irrational. We shall show that for every Riemann integrable 
function / of T 2 we have 

(47) Jirn^ — (jy- Yl f((p( n ) + m/3)t,p(n) + m/3)J = J f(x,y) dxdy. 



m=l 



Applying this for f(x, y) = 4>(y) ■ e(x — t{y}), and noticing that the integral of / is zero, we get 
the advertised result. 

Next we verify (|47p . Using a standard approximation argument by trigonometric polynomials 
we can assume that f(x, y) = e(l\x + l<iy) for some l\, l 2 G Z not both of them 0. So it suffices 
to show that 



(48) lim — V e(m(ht + l 2 )P)A m = 

M — >oo IVI — 
m=l 

where 



n-m + Nm 



A m = j-r- Y e {hp(n)t + l 2 p(n)). 



N m 

m n=n 



Suppose first that l\ = 0. Then I2 7^ 0, and since the sequence A m is known to converge and 
(3 is irrational, we get that the limit in (|48p is 0. 

Suppose now that l\ ^ 0. Notice that the leading coefficient of the polynomial l\p{n)t+l2p{n) 
is (l\t + 12)01 and this is irrational if and only if t ^ Q/q + Q. If this happens to be the case, 
then A m — > and (pl8"j) follows. Otherwise we have t = qi/a + q 2 for some qi,q 2 G Q- Since i is 
assumed to be irrational we have q\ 7^ and a is irrational. Since the sequence A m is known 
to converge, the limit in ([4*8]) is equal to a constant times 



1 M 

(49) lim — V e(m(/igi/a + /ig 2 + / 2 )/3). 

m=l 

We can rewrite {l\q\/a + /iQ2 + h)(3 as (51/a + q 2 )P for some (71,(72 G Q with 7^ 0. Since 
a is irrational, this last expression is non-zero, and as a consequence the limit in ([49]) is going 
to be unless (qi/a + q%){5 is a non-zero integer. But this cannot be the case since by our 
assumption 1/(3 ^ <Q)/a + Q. Therefore, the limit in (|49p is completing the proof of (|47p. 

It remains to deal with the case where i G (0, 1) is rational. For convenience we shall assume 
that t = 1/k for some integer k with k > 2 (if the numerator of f is not 1 the argument is 
similar). Using a standard approximation argument we can assume that (p(y) = e(ly) for some 
I G Z. Then the limit in question becomes 

j M n m +iV m 



m=l m n=n m 
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Arguing as in the proof of (|47p we can show that the sequence ((Z+l/&)(p(re)+m/3),p(n)+m/3) 
is equidistributed in the subset H = { ((lk-\-l)x, kx) : iGT} ofT 2 with respect to the sequence 
(Sm)m£N, where 

Sm = {{ m i n) G Z 2 : 1 < m < M, n m <n< n m + N m }. 
Therefore, the last limit is equal to 

/ e((lk + l)x - {kx}/k) dx = V" / e((lk + l)x - (kx - i)/k) dx. 

Jo ~?,Ji/k 



i=0 



Using the change of variables x ^ y + i/k we see that the last expression is equal to 
(50) 



Jo ~Z 



i=0 

Since I G Z and k > 2 we have Ik + 1 7^ 0. It follows that the sum appearing in (|50p is zero. 
This completes the proof. □ 

Proof of Theorem \l.£[ If |a(x) — cp(x)\ y logx for every p G Z[x] and c£R, then by [T7] the 
sequence ([a(n)]) ng ^ is good for the ergodic theorem^!, and the result follows. 

Therefore, we can assume that a(x) = cp(x) + b(x) for some p G Z[x], c G R, and b G TL 
that satisfies 1 -< b(x) -< x. Furthermore, we can assume that b(x) > for large x (the other 
case can be treated similarly). If c = 0, then the range of the sequence ([fr(n)] ) n eN contains all 
large enough integers (see Lemma I5.2p and so forms a set of recurrence. If c 7^ 0, let (5 G R 
be such that 1/(3 ^ Q/c + Q. Since b(x) ~< x we have b(x + 1) — b(x) — ► (see Lemma l5~2l . 
and since b(x) — > 00 it follows that there exist integers N m with iV m — * 00 and intervals 
/ m = [n m , n m + iV m ], where n m G N, such that 

(51) lim sup \b(n) — m(5\ = 0. 
Let 

(52) J = {(m,n) G N 2 : {cp(n) + m(3} G [1/2,3/4]}. 
Consider the sequence (Sm)m&\ where 

S m = {(m, n) G N 2 : 1 < m < M, n m < n < n m + N m }. 

Notice that because of ([5Tj) and ([52]) . for (m, n) G Sm n J with m big enough, we have 
[cp(n) + b(n)] = [cp(n) + m(3}. Applying Lemma IT7L1 for cfr = lp/23/4]) and noticing that the 
set J has positive density (= 1/4) with respect to the sequence (Sm)mgn, we have for every 
t G (0, 1) that 

^ jh E e (M™) + b(n)]t) = Jjm -L e([cp(n) + mflt) = 0. 

Using this and the spectral theorem we get that 

J!!».j£i E //■i****'"/ 4. -(//*)' 

1 7U| (m,n)eS M nJ"' 17 



^This is an easy consequence of the spectral theorem and the following equidistribution result of Boshernitzan 
|16j : If a € TL has polynomial growth and satisfies \a(x) — p(x)\ y logs for every p £ Q[x], then the sequence 
(a(n)) ne n is equidistributed in T. 
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for every / G L°°(fi). This implies the result and completes the proof. 



□ 
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